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Description 

METHODS AND COMPOSITIONS FOR SALMONELLA-BASED VACCINES 

5 Cross-Reference To Related Applications 

This application is a continuation-in-part of United States patent 
application Serial No. 08/054,452, filed April 26, 1993 and presently pending. 

Technical Field 

10 The present invention relates generally to methods and compositions for 

eliciting an immune response, and more particularly to such compositions directed to, or 
based on, Salmonella. 

Background of the Invention 

15 In 1980, the World Health Organization estimated that in developing 

countries food poisoning from infection with Salmonella bacteria (salmonellosis) 
contributed in a major way to food borne infections resulting in more than 1 billion cases 
of acute diarrhea in children under the age of five years (Kvenberg and Archer, Food 
TechnoL 40:77-98, 1987, and at least 5 million deaths (this reference, and all other 

20 references cited herein, is hereby expressly incorporated herein by reference in its 
entirety). Since the mid-1980s, the worldwide incidence of salmonellosis has increased 
steadily. S. ententidis (also known as S. enterica sen Enteritidis), in particular, has 
been implicated in the sharp increase in food borne infection in industrialized countries 
since 1983. Indeed, the current frequency of SI enteritidis infections is considered to 

25 constitute a worldwide pandemic (Rodrigue et aL, Epidemiol Infect 705:21-27, 1990). 

The severity of the disease is greatest in infants, the elderly, the infirm 
and in other persons with inadequate or impaired immune systems, including the 
malnourished. In third world countries where malnutrition is more commonly a 
complicating factor, mortality rates due to S. enteritidis infection as high as 28% have 

30 been reported. In both the clinical and industrial settings, the. situation is also 
complicated by the feet that many people are asymptomatic carriers. Salmonella spp., 
including 5. enteritidis, often possess several plasmid encoded antibiotic resistance genes 
that complicate the treatment of human infections. 

In the industrialized world, it is the contamination of food products by 

35 Salmonella bacteria that is most directly threatening to human health. Hence, it is not 
surprising that the increase in salmonellosis in first world countries parallels the 
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centralization of food production and processing despite continued improvements in 
epidemiological and microbiological methods. 

The significance of the problem is reflected in one aspect in the poultry- 
related industries. For example, in the U.S. alone hatcheries produce approximately 100 
5 million broiler chicks per week and chicken egg production in the U.S. has reached 5 
billion annually. A large proportion of S. enteritidis infections have been associated with 
the contamination of the contents of whole shell eggs resulting from vertical 
transmission of this pathogen due to transovarian infection. This is significant since 
common procedures designed to decontaminate the external shell surface are not 

10 effective. The problem presented by S. enteritidis is exacerbated by the feet that 
infection in the adult laying hens may be asymptomatic. Typically, S. enteritidis 
infection of laying birds does not have a significant adverse effect on fertility, 
hatchability or egg production. Similarly, broiler chickens may be asymptomatic 
throughout their lifetime, although losses of about 20% do occur in infected flocks due 

IS to death in chicks, retardation of growth and rejection of contaminated birds at 
slaughtering. Contaminated poultry feed may be a major source of infection, but stress 
to poultry due to handling, transportation and overcrowding add to the problem by 
increasing the shedding of Salmonella from infected chickens. The end result is that the 
majority of modern processing plants, which process about 10,000 birds per hour, are 

20 contaminated and Salmonella are typically isolated from 40% to 70% of turkey or 
chicken carcasses sampled in the U.S. and Canada (Lammerding et al. J. Food 
Protection 57:47, 1986). 

The overall economic costs of the rising incidence of food borne 
infections have been significant. The U.S. General Accounting Office has recently 

25 estimated the cost of S. enteritidis food poisoning in the U.S. between 1985 to 1990 at 
$118 million in lost productivity, medical and hospital costs resulting from about 9,500 
illnesses. The U.S. Center for Disease Control receives more than 40,000 case reports 
annually but attributes greater than 2 million cases and roughly 2,000 deaths per year in 
the United States to salmonellosis (Cohen and Tauxe, Science 254:964-969, 1986). The 

30 economic cost related to treatment of salmonellosis in the U.S. was estimated to be S50 
million in 1986. About 8 million cases involve physician consultation and an estimated 
250,000 cases require hospitalization. Non-hospitalized cases are thought to have 
accounted for about $680 million in medical costs and minimally $2 billion in lost 
productivity. Others estimate the total costs of salmonellosis in the U.S. arising from 

35 medical treatment and lost productivity to be as high as S23 billion per year (Kvenberg 
and Archer, supra). 



A 



WO 94/25598 " PCT/EB94/00207 

3 

The losses absorbed by the food industry from liability and product loss 
are undoubtedly passed on to the consumer. Thus, there is a need for an effective risk- 
management program to monitor the different phases of poultry production including 
breeding, raising, slaughtering, packing and further processing, distribution and 
5 preparation, and consumption. The development of strategies for creating Salmonella- 
free feed, the control of Salmonella in breeder flocks, hatcheries, and product operations 
will include development of more effective diagnostics and vaccines. 

It is generally accepted that killed Salmonella vaccines are inferior to live 
attenuated Salmonella vaccines so this method has been more-or-less abandoned as a 

10 vaccine strategy. Several reports exist that explore humoral response of the host to 
Salmonella components but this strategy has not been widely adopted. Some examples 
include an outer membrane protein preparation with 4% LPS content, which reportedly 
gave some protection to S. typhimurium in mice (Isibasi et aL, Infect Immwt 56:2953, 
1988). However, Salmonella fimbriae have not been explored for use as subunit 

15 vaccines. 

Putative vaccines have been developed with S. typhi to prevent typhoid 
fever in humans. However, several non-typhoid Salmonella spp. which cause 
gastroenteritis in humans are host-adapted to cause a typhoid-like disease in certain 
a n i m a l s. Consequently, attenuating mutations that have been made in one strain have to 

20 be transferred to other Salmonella strains to create attenuated mutant vaccines 
responsive to host-specific diseases. In terms of vaccine design, four basic approaches 
have been attempted, namely killed vaccine strains, subunit vaccines using purified cell 
components to elicit protective antibody response, live attenuated strains and live 
attenuated strains expressing foreign proteins. 

25 Several putative vaccines have been devised that express foreign epitopes 

on the surface of cells of attenuated Salmonella. Some examples include immunogenic 
epitopes of hepatitis B surface antigens (Wu et aL, Proc. NatL Acad Set USA 86:4726- 
4730, 1989; Schodel et aL, in Progress in Hepatitis B Immunization, Coursget and 
Tong, eds., pp. 43-50, 1990), the major surface protein, gp63, of Leishmania (Young 

30 et aL, X Immunol. 745:2281-2285, 1990), the Streptococcal M protein (Poirer et aL, 
Exp. Med 163:25-32, 1988), and the 31 KDa protein of Brucella abortus (Statel et aL, 
Infect Immun 55:2048-2055, 1990). In some instances, foreign epitopes have been 
presented on flagella through incorporation of foreign DNA into the flagellin protein 
gene encoded on an expression vector. For example, a cholera toxin epitope has been 

35 expressed from a recombinant plasmid in a aroA m t flagellin* strain of 51 dublin (Newton 
etal., Science 244:10-12, 1989). However, none of these systems include the use of 
fimbriae or fimbrial gene products in the vaccines. 
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A live attenuated Salmonella vaccine that persists long enough in the 
host to elicit a long lasting high level immunity is desired. Several attenuating mutations 
have been identified that have been used in one or more Salmonella spp. singly or in 
combination for potential use as vaccines. Such vaccines have occasionally shown some 
5 protection in mice. 

However, there has gone unmet a need for vaccine technologies to 
reduce the problems associated with food borne, and other, salmonellosis. The present 
invention provides these and other related advantages. 

10 gummary qf the foventiqn 

In one aspect, the present invention provides a composition capable of 
eliciting an immune response, and preferably a vaccine, comprising an isolated AgfA 
protein in combination with a physiologically acceptable carrier or diluent In alternative 
embodiments, the composition comprises a SefB, SefC, SefD, TctC, TctB or TctA 

IS protein. 

In a further aspect, the present invention provides vector constructs 
comprising one or more of a mutant ictA gene, tctB gene, tciC gene or the tctl operon 
that is able to inactivate the corresponding tricarboxylic acid transport pathway in 
Salmonella, to yield an attenuated Salmonella. Preferably, the vector construct further 

20 comprises a mutant gene that inactivates the ability of Salmonella to utilize succinate. 
Alternatively, the vector construct comprising the mutant tctA, tctB, or tctC gene is used 
in combination with a vector construct that inactivates the ability of Salmonella to 
utilize succinate. This aspect of the invention therefore provides a biologically pure, 
attenuated Salmonella comprising a mutation in its ictA, tctB, or tctC gene, and/or a 

25 gene required for succinate utilizatioa 

This aspect of the invention further provides a composition capable of 
eliciting an immune response, and preferably a vaccine, comprising an attenuated 
Salmonella that has an inactivating mutation in one or more of its tctA f tctB, or tctC 
genes in combination with a physiologically acceptable carrier or diluent. Alternatively, 

30 the composition comprises an attenuated Salmonella having an inactivating mutation in 
one or more of its tctl operon (tctDCBA\ tctll operon, or tctHl operon. As a further 
alternative, the Salmonella, either in combination with an attenuating mutation as above 
or instead of such a mutation, has a mutation in two or more fimbrin encoding genes, 
such as sefA, sefD t qfgA or fimA, wherein the mutation effectively prevents production 

35 of fimbriae from such genes. 

In preferred embodiments, the attenuated Salmonella is able to express a 
foreign antigen in one or more of its fimbriae. Alternatively, the foreign antigen is fused 
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to a SefA, SefD, SefC, TctA or AgfA protein. The present invention also provides such 
attenuated Salmonella in a biologically pure form. 

In yet a further aspect, the present invention provides an expression 
vector construct comprising an agfA gene that is operably fused in an open leading 
5 frame to a foreign gene to yield a diristronic gene product, the dicistronic gene product 
able to be expressed in a fimbria of a Salmonella or an aggregate comprising such gene 
product. By "foreign gene" it is meant a gene that is not typically expressed in open 
reading frame immediately after the agfA gene, and preferably a gene that is not found in 
Salmonella in the wild state. Such a foreign gene preferably provides a foreign antigen 

10 or epitope, including an immunologically active fragment thereof. In alternative 
embodiments, the expression vector construct comprises a tctA gene, tctB gene, tctC 
gene, sefA gene, sefB gene, seJC gene or an sefD gene. In a further alternative 
embodiment, the fimbria or aggregate is expressed in an E coli, or in a Shigella spp. In 
still another alternative embodiment, the expression vector construct comprises an agfA 

15 gene, sefA gene or a sejD gene able to produce in E coli, Shigella or other 
Enterobacteriaeeae a stable fimbria that comprises AgfA protein and/or SefD protein. 

In still a further aspect, the present invention provides a stable fimbria or 
aggregate comprising an AgfA, SefA, SefD, SefC, SefB, TctC, TctB or TctA protein 
fused to one or more foreign antigens. In an alternative embodiment, this aspect of the 

20 present invention provides a stable amino acid polymer comprising an AgfA, SefA, 
SefD, SefC, SefB, TctC,. TctB or TctA protein fused to one or more foreign antigens. 
Within the context of the present invention, such a stable amino acid polymer, preferably 
formed in part of AgfA, is able to pass as a naked protein through the stomach of an 
animal without significant degradation 

25 In yet a further aspect, the present invention provides methods of 

eliciting an immune response in, and preferably vaccinating, an animal. In this aspect, 
fimbriae or aggregates comprising an AgfA, SefA, SefD, SefC, SefB, TctC, TctB or 
TctA protein fused to a foreign antigen are separated from their Salmonella host cell, 
and then introduced into the animal in conjunction with a physiologically, or 

30 pharmaceuticaliy, acceptable carrier or diluent. In an alternative embodiment, the host 
cell is used to grow a stable amino acid polymer comprising one or more of an AgfA, 
SefA, SefD, SefC, SefB, TctC, TctB or TctA protein fused to a foreign antigen, which 
polymer may not comprise a fimbria. In a further alternative embodiment, the fimbriae 
or aggregate are separated from the host cell prior to being introduced into the animal. 

35 In still, a further alternative embodiment, the fimbria or amino acid 

polymer comprises an AgfA, SefA, SefD, SefC, SefB, TctC, TctB or TctA protein and 
are grown in an E coli host cell or a Shigella host cell Alternatively, the host cell for 
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the expression vector construct is Citrobacter, Enterobacteria, Pseudomonas, 
Streptomyces, Bacillus, or Staphylococcus aureus. Preferably, the AgfA, SefA, SefD, 
SefC, SefB, TctC, TctB or TctA protein of these alternative embodiments is further 
fused to a foreign antigen. 
5 Preferably, the fimbriae or amino acid polymers are introduced orally or 

via injection to the animal. Further preferably, the fimbriae or amino acid polymers are 
packaged in protein or biodegradable polymers or copolymers, such as polylactide, 
polygalactide, polycaprolactone, polyanhydride, polyorthoesters, to form microspheres, 
or are bound to a substrate protein or bacterium. The fimbriae or amino acid polymers 

10 may also be maintained in an array. 

In yet a further aspect, the present invention provides a method of 
eliciting an immune response in, and preferably vaccinating, an animal comprising 
introducing an isolated AgfA protein in combination with a physiologically acceptable 
carrier into the animal. Preferably, the animal for this method of vaccination, and all 

15 other methods of vaccination as described herein, is a human being. In alternative 
preferred embodiments, the animal is a warm-blooded animal, and further preferably; a 
commercially important warm-blooded animal, including a fowl, a pig, a horse, a dog, a 
cat, or a cow. In further alternative embodiments, the animal is another commercially 
important animal such as a shellfish. In alternative embodiments, the isolated protein is a 

20 SefD, SefC, SeflB, TctC, TctB or a TctA protein. 

In still yet another aspect, the present invention provides methods of 
eliciting an immune response in, and preferably vaccinating, an animal comprising 
introducing attenuated Salmonella into the animal. The attenuated Salmonella may 
comprise one or more of an ineffective tctA gene, tctB gene, tctC gene, tctl operon, tcfH 

25 operon, or tctUl operon, in one embodiment in combination with mutations of the agfA, 
seJB, seJD or fimA. As noted above, the attenuated Salmonella may alternatively 
comprise two or more ineffective fimbriae genes. In preferred embodiments, the 
attenuated Salmonella expresses one or more foreign antigens. In a still further 
preferred embodiment, the foreign antigen is located on a SEF17 or SEF18 fimbria of 

30 the attenuated Salmonella, 

It is a further feature of this aspect of the invention that the immune 
response of the animal can be induced by introducing an£ coli into the animal, wherein 
the £ coli expresses a Salmonella fimbria. Preferably, the Salmonella fimbria 
comprises an agfA gene product, a se/A gene product or a sefD gene product. Further 

35 preferably, the Salmonella fimbria further comprises a foreign antigen. Alternatively, a 
Shigella is used in place of the E. coli. Further alternatively, another 
Enterobacteriaceae is used in place of £ coli. 
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In yet a further aspect, the present invention provides a method of 
eliciting an immune response in, and preferably vaccinating, an animal wherein a nucleic 
acid vector is introduced into the animal. The vector comprises a sefA gene, sefD gene, 
sefC gene, sefB gene, tctC gene, tctB gene, tctA gene, or agfA gene. Further preferably, 
5 the vector is naked and is injected into a muscle of the animal and/or the above-listed 
gene is fused to a further, antigen-producing gene to provide an antigenic fusion gene 
product. 

These and other aspects of the present invention will become evident 
upon reference to the following detailed description and attached drawings. In addition, 
10 various references are set forth below which describe in more detail certain procedures 
or compositions (e.g., plasmids, etc.); such references are incorporated by reference in 
their entirety. 

Brief Description of the Drawings 
15 Figure 1 is a schematic illustration and open reading frame analysis of the 

sefU2U [ABCD gene cluster. 

Figures 2A-2D depict the nucleotide sequence of sefA, sefB, sefC and 
sefD, and the corresponding predicted amino add sequences. 

Figures 3A-3B depict the nucleotide sequence of sefUj and sefU2* and 
20 the corresponding predicted amino acid sequences 

Figures 4A-4B depict the nucleotide sequence of tctA, and the 
corresponding predicted amino acid sequence. 

Figure 5 depicts the nucleotide sequence of tctB, and the corresponding 
predicted amino acid sequence. 
25 Figures 6A-6B depict the nucleotide sequence of tctC, and the 

corresponding predicted amino acid sequence. 

Figure 7A depicts the nucleotide sequence of an agfA gene fragment 
amplified from S. enteritis 27655-3b TvphoA mutant strain and cloned into pUC19, and 
the corresponding predicted amino acid sequence. The solid arrows indicate PCR 
30 primer pairs TAF3 and TAF4; the dashed arrows indicate TAF5 and TAF6. 

Figure 7B depicts the nucleotide sequence of agfA, and the 
corresponding predicted amino acid sequence. 

Figure 8 A depicts an autoradiograph of the results of expression of the 
sefA, sefB and sefC genes in an £ coli in vitro transcripti n-translation system Lane I, 
35 pTZ19; Lane 2, pKXi; Lane 3, pSCl; Lane 4, delBIS; Lane 5, deIB23; Lane 6, deEDlO; 
Lane 7, Western blot of the in vitro transcription-translation of pKXl developed using 
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antisera generated against denatured SEF14 fimbria. The size of the molecular weight 
markers is indicated on the left (10 3 M r ). 

Figure 8B is a schematic representation of the se/gene cluster showing 
the inserts of various deletion subclones used in the in vitro transcription-translation 
5 experiments. s 

Figure 9 depicts an autoradiograph of the mapping of the 5' end of the 
se/A transcript using primer extension. The lane labeled *& enteritidis" represents the 
reverse transcriptase products ofRNA isolated from this organism grown in CFA static 
broth for 60 hours at 37°C. The lanes C, A, T, and G represent the results of dideoxy 
10 chain termination sequence reactions in the region encompassing the promoter The 
sequence of the y- 32 P-labeled primer (TGCGTGGGCACTGCCACA) (SEQ ID No. 

) is complementary to nucleotides 181-198 of sefA. The arrows indicate two 

major transcription initiation sites. 

Figures IOA-C depict immunoelectron microscopy of negatively stained 
15 cells for SEF14 production. (A) 51 enteritidis 27655-3b; (B)£. coli HB101 carrying 
cos48; (C)£ coli JM109 carrying pKXL Magnification: (A) xl 15,000; (B)x94,000; 
(C) xl44,000. 

Detailed Description of the Invention 
20 The present invention provides methods and compositions for eliciting an 

immune response from vaccines to Salmonella. These methods and compositions 
include numerous isolated genes specific to Salmonella, vector constructs, numerous 
isolated proteins specific to Salmonella and vaccines. These methods and compositions 
are described further, below. 

25 

I. Genes Specific to Salmonella 

A. Genes Generally 

The present invention provides isolated DNA molecules comprising the 

30 sefU2U jABCD gene cluster, the sefABCD gene cluster, se/BCD gene cluster, seffiU / 
gene cluster, the se/A gene, the sefB gene, the sefD gene, the agfA gene, the tctCBA 
gene cluster, the tctA gene, the tctB gene, and/or the tctC gene. Although one 
embodiment of each of these molecules is shown in Figures 2 to 7B, it should be 
understood that within the context of the present invention, reference to one or more of 

35 these genes includes derivatives of the genes that are substantially similar to the genes 
(and, where appropriate, the protein (including peptides and polypeptides) that are 
encoded by the genes and their derivatives). As used herein, a nucleotide sequence is 
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deemed to be "substantially similar" if: (a) the DNA sequence is derived from the 
coding region of a native gene of any Salmonella serovar and maintains substantially the 
same biological activity (including, for example, portions of the sequence or allelic 
variations of the sequences discussed above); (b) the DNA sequence is capable of 
5 hybridization to DNA sequences of the present invention under moderate, high or very 
high stringency (see Sambrook et al., Molecular Cloning: A Laboratory Manual, 2d 
ed., Cold Spring Harbor Laboratory Press, NY, 1989); or (c) the DNA sequences are 
degenerate as a result of the genetic code to the DNA sequences defined in (a) or (b). 
Within the context of the present invention, moderate stringency means conditions such 

10 that an appropriate nucleotide sequence is able to selectively hybridize to nucleotide 
sequences from Salmonella and to conserved nucleotide sequences in other bacteria 
such as K coli. High stringency means the nucleotide sequence is able to selectively 
hybridize to nucleotide sequences from Salmonella. Very high stringency means the 
nucleotide sequence is able to selectively hybridize to a single Salmonella species, such 

15 as S. enteritidxs, S. dublin or S. gallinarum. 

B. sefA. sefB. sefC sefD. sefU j . sety genes and the sefU^ J jABCD 
operon 

As can be seen by reference to Figure 1, which is a schematic illustration 

20 and open reading frame analysis of the sefl/2U [ABCD gene cluster, the sefA, sefB, and 
sejC genes comprise an approximately 3.9 kilobase pair region of Salmonella DNA. In 
vitro expression directed by the 5.3 kilobase pair DNA fragment of Figure 1 indicated 
that the SefA, SefB and SefC proteins have an approximately 14 K, 28 K, and 90 KA/r 
molecular weight, respectively. See Figure 8 A The present invention involves one or 

25 more of a sefA, sejB, seJC, se/D, sefUj, or sefU2 gene or gene cluster. Further 
information with respect to these genes and their products may be found in U.S. 

Application Ser. No. 08/054,542. See also Application Ser. No. (attorney's 

docket no. 920043.403C1), and Application Ser. No. (attorney's docket 

no. 920043.403C2). As noted above, these applications and all other references cited 

30 herein are expressly incorporated by reference herein in their entirety. 

Experimental results with S. enteritidis indicate that sefB and sefC are 
not expressed in the absence of sefA. Primer extension analysis of sefABC gene clusters 
revealed two major transcription start cites located upstream of sefA (Figure 9). 
Transcription of sefB and sefC is also initiated from the sefA promoter region. 

35 Secondary structure analysis of the mRNA transcript from sefABC predicted the 
formation of two stable stem-loop structures in the intercistronic region between sefA 
and sefB, which is indicative of differential regulation of sefA as opposed to sefB and 
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seJC translation. The nucleotide sequences, and corresponding amino acid sequences, of 
sefA % seJB, seJC, sefD, sefU / and se/t/? are depicted in Figures 2 and 3. 

se/Uj and se/U2 are overlapping open reading frames oriented in the 
opposite directions (/.£., encoded on opposite strands). seJD abuts sefC and a consensus 
5 Shine-Dalgarno ribosomal binding she sequence is just inside the sejC or f. No promoter 
is recognized immediately upstream of sefD', expression may be directed by the sefA 
promoter. 

C. agfA gene 

10 The agfA gene codes for a structural fimbrin protein composing very thin 

fimbriae, approximately 3-4 nanometers in diameter, that are highly aggregative and 
stable. The aggregative property of the fimbriae is believed to be due at least in part to 
its hydrophobicity. The gene product of the agfA gene, AgfA, is found in SEF17 
fimbriae, and comprises an approximately 14-15 KMr molecular weight fimbrin protein. 

15 AgfA contributes to heavy pellicle formation in static cultures, colony hydrophobicity, 
and autoaggregation of cells in culture. SEF17 fimbriae are immunologically distinct 
from SEF14 and SEF21, as indicated by the lack of cross-reactivity with polyclonal 
antisera raised in rabbits against SEF14 or SEF21 (Collinson et ai., "Purification and 
characterization of thin, aggregative fimbriae from S. enteritidis^ J. Bacterial. 

20 773:4773-478 1, 1991). Figure 7A depicts the nucleotide sequence of an agfA gene 
fragment that was amplified from the S. enteritidis 27655-3b TvphoA mutant strain and 
then cloned into pUC19. The bases underlined in the agfA sequence of Figure 7A are 
common to portions of the PGR primers TAF1 and TAF2, which were used in 
amplification of this fragment. Below the agfA sequence in Figure 7 A is the translated 

25 amino acid sequence. As discussed further below, the nucleotide sequences targeted by 
the diagnostic PCR primer pairs TAF3 and TAF4 (solid arrows) and TAF5 and TAF6 
(dashed arrows) are also indicated in Figures 3A-3B. Below the agfA sequence in 
Figure 7 is the translated amino acid sequence. 

Figure 7B depicts the nucleotide and amino acid sequences of the full 

30 agfA gene of S. enteritidis 27655-3b. 

D. tctA. tctB, and tctC genes, and the tctl tall and tctlll operons 

The tctA, tctB, tctC and tctD genes are located in the tctl operon, which 
is one of three operons for tricarboxylic acid transport within Salmonella spp. The 
35 other Salmonella tricarboxylic acid transport systems are termed tctll and tctlll. It is 
believed that the tricarboxylic acid transport system of Salmonella is lacking from 
several other related genera o£Enterobacteriaceae. tctl and tctlll both encode proteins 
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responsible for citrate transport and are induced by growth of Salmonella typhimurhmt 
on minimal media supplemented with citrate as a carbon source, tctll is normally not 
expressed. Each system demonstrates preferential transport of various tricarboxylic 
acids and different concentrations of the monovalent cations Na + and K + . The transport 
5 of tricarboxylic acids, particularly citrate, are potentially important to the intracellular 
survival of Salmonella, as such transport provides a means of scavenging nutrients from 
a host cell. Accordingly, mutants in these various tct systems, preferably in 
S. typhimurium or 5. enteritidis, potentially create attenuated strains of Salmonella 
capable of being taken into a host cell, and capable of persisting for a limited time, yet 
10 not able to proliferate within such a host celL In a preferred embodiment, one or more 
of these tct mutants are coupled with a mutant unable to utilize succinate to provide a 
"back-up" system to assure attenuation. 

The nucleotide sequences for tctA. tctB and tctC, along with their 
corresponding amino acids, are depicted in Figures 4, 5 and 6, respectively. 

15 

II. Vector Constructs Comprising the Gene Sequences of the Present Invention 

A. Vector Constructs Generally 

The present invention provides for the manipulation and expression of 

20 the above described genes by culturing host cells containing a construct capable of 
expressing the above-described genes, including substantially similar derivatives thereof 
Numerous vector constructs, including all or part of the nucleotide 
sequences of a native or derivative sefA, sefB, sejC, sejD, sefU^ se/U /, agfA, tctA 9 
tctB, and/or tctC genes, as described above, can be prepared as a matter of convenience. 

25 Within the context of the present invention, a DNA construct is understood to refer to a 
DNA molecule, or a clone of such a molecule (either single-stranded or double- 
stranded), that has been modified through human intervention to contain segments of 
DNA combined and juxtaposed in a manner that as a whole would not otherwise exist in 
nature. Vector constructs of the present invention comprise a first DNA segment 

30 encoding one or more of the se/A, sef3 f seJC, sefD, se/U], sefUj* GgfA, tctA, tctB y 
and/or tctC genes operably linked to additional DNA segments required for the 
expression of the first DNA segment. Within the context of the present invention, 
additional DNA segments will include a promoter and will generally include 
transcription terminators, and may further include enhancers and other dements. 

35 Mutations in nucleotide sequences constructed for expression of variant 

proteins preferably preserve the reading frame of the coding sequences. Furthermore, 
the mutations will preferably not create complementary regions that could hybridize to 
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produce secondary mRNA structures, such as loops or hairpins, that would adversely 
affect translation of the mRNA. Although a mutation site may be predetermined, it is 
not necessary that the nature of the mutation per se be predetermined. For example, in 
order to select for optimum characteristics of mutants at a given site, random 
5 mutagenesis may be conducted at the target codon and the expressed mutants screened 
for indicative biological activity. 

Mutations may be introduced at particular loci by synthesizing 
oligonucleotides containing a mutant sequence, flanked by restriction sites enabling 
ligation to fragments of the native sequence. Following ligation, the resulting 
10 reconstructed sequence encodes a derivative having the desired amino acid insertion, 
substitution, or deletion. 

Alternatively, oligonucleotide-directed site-specific mutagenesis 
procedures may be employed to provide an altered gene having particular codons altered 
according to the substitution, deletion, or insertion required. Exemplary methods of 
15 making the alterations set forth above are disclosed by Walder etaL (Gene 42:133, 
1986); Bauer et aL {Gene 37:73, 1985); Craik (BioTectmiques, January 1985, 12-19); 
Smith et aL (Genetic Engineering: Principles and Methods, Plenum Press, 1981); and 
Sambrook et al. (supra). 

The primary amino acid structure of the above described proteins may 
20 also be modified by forming covalent or aggregative conjugates with other chemical 
moieties, such as glycosyl groups, lipids, phosphate, acetyl groups, or with other 
proteins or polypeptides. 

Within a further embodiment, the above described proteins may be fused 
with other peptides that facilitate purification or identification of these proteins. For 
25 example, a protein can be prepared as a fusion protein with the FLAG polypeptide 
sequence (see U.S. Patent No. 4,851,341; see also Hopp et al., Bio/Technology tf:1204, 
1988). The FLAG polypeptide sequence is highly antigenic and provides an epitope for 
binding by a specific monoclonal antibody, enabling rapid purification of the expressed 
recombinant protein. This sequence is also specifically cleaved by bovine mucosal 
30 enterokinase at the residue immediately following the Asp-Lys pairing 

B. Expression Vectors 

One type of vector construct, known as an expression vector, can contain DNA 
segments necessary to direct the secretion of a polypeptide of interest. Such DNA 
35 segments can include at least one secretory signal sequence. Preferred secretory signals 
include the yeast alpha factor signal sequence (pre-pro sequence; Kuijan and 
Herskowitz, Cell 30:933-943, 1982; Kuijan etaL, U.S. Patent No. 4,546,082; Brake, 
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EP 116,201), the Pho5 signal sequence (Beck etaL, WO 86/00637), the Suc2 signal 
sequence (Carlson et at., Mol Cell Biol 3:439-447, 1983), the a-2 plasmin inhibitor 
signal sequence (Tone etaL, I Biochem. (Tokyo) 102: 1033-1042, 1987), the tissue 
plasminogen activator signal sequence (Pennica et al., Nature 307:214-221, 1983), the 
5 K coli PhoA signal sequence (Yuan et al., J. Biol. Chem. 255:13528-13552, 1990), or 
any of the other bacterial signal sequences known in the art, such as those reviewed by 
Oliver {Ann. Rev. Microbiol 39:615-649, 1985). Alternatively, a secretory signal 
sequence can be synthesized according to the rales established, for example, by von 
Heinje (Eur. J. Biochem. 733:17-21, 1983;/ Mol Biol W:99-105, 1985; Nuc Acids 
10 Res. 74:4683-4690, 1986). Secretory signal sequences can be used singly or in 
combination. 

For expression, a DNA molecule as described above is inserted into a 
suitable vector construct, which in turn is used to transform or transfect appropriate host 
cells for expression. Host cells suitable for use in practicing the present invention 

15 include mammalian, avian, plant, insect, bacterial and fungal cells. Preferred eukaryotic 
cells include cultured mCT ma ^ fln cell lines (e.g., rodent or human cell lines) and fungal 
cells, including species of yeast (e.g., Saccharomyces spp., particularly S. cerevisiae, 
Schizosaccharomyces spp., or Uuyveromyces spp.) or filamentous fungi (e.g., 
Aspergillus spp., Neurospora spp.). Strains of the yeast Saccharomyces cerevisiae are 

20 particularly preferred. Methods for producing recombinant proteins in a variety of 
prokaryotic and eukaryotic host cells are generally known in the art (see, "Gene 
Expression Technology," Methods in Enzymology, Vol 185, Goeddel (ed), Academic 
Press, San Diego, Calif, 1990; see also, "Guide to Yeast Genetics and Molecular 
Biology/ Methods m Enzymology, Guthrie and Fink (eds.), Academic Press, San Diego, 

25 Calif, 1991). In general, a host cell will be selected on the basis of its ability to produce 
the protein of interest at a high level or its ability to cany out at least some of the 
processing steps necessary for the biological activity of the protein. In this way, the 
number of cloned DNA sequences that must be introduced into the host cell can be 
minimized and overall yield of biologically active protein can be maximized. 

30 Suitable yeast vectors for use in the present invention include YRp7 

(Struhl et al., Proa Natl Acad Sci. USA 76:1035-1039, 1978), YEpI3 (Broach et aL, 
Gene 5:121-133, 1979), pJDB249 and pJDB219 (Beggs, Nature 275:104-108, 1978) 
and derivatives thereof. Such vectors will generally include a selectable marker, which 
may be one of any number of genes that exhibit a dominant phenotype for which a 

35 phenotypic assay exists to enable transformants to be selected. Preferred selectable 
markers are those that complement host cell aux trophy, provide antibiotic resistance or 
enable a cell to utilize specific carbon sources, and include leu2 (Broach et al., ibid), 
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ura3 (Botstein et al., Gene 8:11, 1979), or his3 (Struhl et aL, ibid). Another suitable 
selectable marker is the cat gene, which confers chloramphenicol resistance on yeast 
cells. 

Preferred promoters for use in yeast include promoters from yeast 
5 glycolytic genes (Hitzeman et al., /. Biol. Chem. 255:12073-12080, 1980; Alber and 
Kawasaki, J. Mol Appl Genet 7:419-434, 1982) or alcohol dehydrogenase genes 
(Young etal., in Genetic Engineering of Microorganisms for Chemicals, Hollaender 
et aL (eds.j, p. 355, Plenum, New York, 1982; Ammeter, Metk Enzymol. 707:192-201, 

1983) . The expression units may also include a transcriptional terminator. 

10 In addition to yeast, proteins of the present invention can be expressed in 

filamentous fungi, for example, strains of the fungi Aspergillus (McKnight et al., U.S. 
Patent No. 4,935,349, which is incorporated herein by reference). Examples of useful 
promoters include those derived from Aspergillus niaulans glycolytic genes, such as the 
adh3 promoter (McKnight et aL, EMBO J. 4:2093-2099, 1985). An example of a 

15 suitable terminator is the adh3 terminator (McKnight etal., ibid., 1985). The 
expression units utilizing such components are cloned into vectors that are capable of 
insertion into the chromosomal DNA of Aspergillus. 

Techniques for transforming fungi are well known in the literature, and 
have been described, for instance, by Beggs (ibid.), Hinnen et aL (Proc. NatL Acad Set 

20 USA 75:1929-1933, 1978), Yelton etal. (Proc. Natl. Acad Sci. USA 57:1740-1747, 

1984) , and Russell (Nature 507:167-169, 1983). The genotype of the host cell wffl 
generally contain a genetic defect that is complemented by the selectable marker present 
on the expression vector. Choice of a particular host and selectable marker is well 
within the level of ordinary skill in the art 

25 In addition to fungal cells, cultured mammalian cells may be used as host 

cells within the present invention. Preferred cultured mammalian cells for use in the 
present invention include the COS-l (ATCC No. CRL 1650), COS-7 (ATCC No. CRL 
1651), BHK (ATCC No. CRL 1632), and 293 (ATCC No. CRL 1573; Graham et al., J. 
Gen. Virol. 56:59-72, 1977) cell lines. A preferred BHK cell line is the BHK 570 cell 

30 line (deposited with the American Type Culture Collection under accession number CRL 
10314). In addition, a number of other mammalian cell lines may be used within the 
present invention, including Rat Hep I (ATCC No. CRL 1600), Rat Hep H (ATCC No. 
CRL 1548), TCMK (ATCC No. CCL 139), Human lung (ATCC No. CCL 75.1), 
Human hepatoma (ATCC No. HTB-52), Hep G2 (ATCC No. HB 8065), Mouse liver 

35 (ATCC No. CCL 29. 1), NCTC 1469 (ATCC No. CCL 9.1), SP2/0-Agl4 (ATCC No. 
1581), HIT-T15 (ATCC No. CRL 1777), and RINm 5AHT 2 B (Orskov and Nielson, 
FEBS 229(1): 175-1 78, 1988). 
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Mammalian expression vectors for use in carrying out the present 
invention will include a promoter capable of directing the transcription of a cloned gene 
or cDNA Preferred promoters include viral promoters and cellular promoters. Viral 
promoters include the immediate early cytomegalovirus promoter (Boshart et al, Cell 
5 47:521-530, 1985) and the SV40 promoter (Subramani etal, Mol Cell Biol 7:854- 
864, 1981). Cellular promoters include the mouse metallothionein-1 promoter (Palmiter 
et al., U.S. Patent No. 4,579,821), a mouse Vj promoter (Bergman et al., Proc. Natl 
Acad Set USA 57:7041-7045, 1983; Grant et al., Nuc. Acids Res. 75:5496, 1987) and a 
mouse Vh promoter (Loh et al., Cell J3:85-93, 1983). Such expression vectors may 

10 also contain a set of RNA splice sites located downstream from the promoter and 
upstream from the DNA sequence encoding the peptide or protein of interest. Preferred 
RNA splice sites may be obtained from adenovirus and/or immunoglobulin genes. Also 
contained in the expression vectors is a polyadenylation signal located downstream of 
the coding sequence of interest. Suitable polyadenylation signals include the early or 

15 late polyadenylation signals from SV40 (Kaufman and Sharp, ibid\ the polyadenylation 
signal from the Adenovirus 5 E1B region and the human growth hormone gene 
terminator (DeNoto et al., Nuc. Acids Res. £3719-3730, 1981). The expression vectors 
may include a noncoding viral leader sequence, such as the Adenovirus 2 tripartite 
leader, located between the promoter and the RNA splice sites. Preferred vectors may 

20 also include enhancer sequences, such as the SV40 enhancer and the mouse 1 enhancer 
(Gillies, Cell 35:717-728, 1983). Expression vectors may also include sequences 
encoding the adenovirus VA RNAs. Suitable expression vectors can be obtained from 
commercial sources (e.g., Stratagene, La Jolla, Calif.). 

Vector constructs comprising cloned DNA sequences can be introduced 

25 into cultured mammalian cells by, for example, calcium phosphate-mediated transfection 
(Wigler et al., Cell 14:125, 1978; Corsaro and Pearson, Somatic Cell Genetics 7:603, 
1981; Graham and Van der Eb, Virology 52:456, 1973), electroporation (Neumann 
etal., EMBO J. 7:841-845, 1982), or DEAE-dextran mediated transfection (Ausubel 
et al. (eds.), Current Protocols in Molecular Biology, John Wiley and Sons, Inc., NY, 

30 1987), which are incorporated herein by reference. To identify cells that have stably 
integrated the cloned DNA, a selectable marker is generally introduced into the cells 
along with the gene or cDNA of interest Preferred selectable markers for use in 
cultured mammalian ceils include genes that confer resistance to drugs, such as 
neomycin, hygromycin, and methotrexate. The selectable marker may be an amplifiabie 

35 selectable marker. Preferred amplifiabie selectable markers are the DHFR gene and the 
neomycin resistance gene. Selectable markers are reviewed by Thilly (Mammalian Cell 
Technology, Butterworth Publishers, Stoneham, MA, which is incorp rated herein by 
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reference). The choice of selectable markers is well within the level of ordinary skill in 
the art. 

As discussed further below, naked vector constructs can also be taken up 
by muscular cells subsequent to injection into the muscle of a mammal (or other 
5 animals). 

Selectable markers may be introduced into the cell on a separate vector 
at the same time as the sefA, B, sefC t sefD, agfA, tctA, tctB, and/or tctC genes 
sequences, or they may be introduced on the same vector. If on the same vector, the 
selectable marker and the sefA, sejB, sejC, sefD, agfA, tctA, tctB, and/or tctC genes 

10 sequences may be under the control of different promoters or the same promoter, the 
latter arrangement producing a dicistronic message. Constructs of this type are known 
in the art (for example, Levinson and Simonsen, U.S. Patent No. 4,713,339). It can also 
be advantageous to add additional DNA, known as "carrier DNA," to the mixture that is 
introduced into the cells. 

15 Mammalian cells containing a suitable vector are allowed to grow for a 

period of time, typically 1-2 days, to begin expressing the DNA sequence(s) of interest 
Drug selection is then applied to select for growth of cells that are expressing the 
selectable marker in a stable fashion. For cells that have been txansfected with an 
amplifiable, selectable marker the drug concentration may be increased in a stepwise 

20 maimer to select for increased copy number of the cloned sequences, thereby increasing 
expression levels. Cells expressing the introduced sequences are selected and screened 
for production of the protein of interest in the desired form or at the desired level. Cells 
that satisfy these criteria can then be cloned and scaled up for production. 

The preferred prokaryotic host cell for use in expressing the gene 

25 sequences of the present invention is Salmonella. Other preferred host cells include 
strains of the bacteria R coli y although Bacillus, Shigella and other genera are also 
useful. Techniques for transforming these hosts and expressing foreign DNA sequences 
cloned therein are well known in the art (see, e.g., Maniatis et aL, Molecular Cloning: 
A Laboratory Manual, Cold Spring Harbor Laboratory, 1982, which is incorporated 

30 herein by reference; or Sambrook et aL, supra). Vectors used for expressing cloned 
DNA sequences in bacterial hosts will generally contain a selectable marker, such as a 
gene for antibiotic resistance, and. a promoter that functions in the host cell. 
Appropriate promoters include the trp (Nichols and Yanofsky, Meth EnzymoL 707:155- 
164, 1983), lac (Casadaban etal., 1 BacterioL 745:971-980, 1980), and phage X 

35 (Queen, J. Mol Appl Genet 2:1-10, 1983) promoter systems. Plasmids useful for 
transforming bacteria include the pUC plasmids (Messing, Metk EnzymoL 707:20-78, 
1983; Vieira and Messing, Gene 79:259-268, 1982), pBR322 (Bolivar et aL, Gene 2:95- 
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113, 1977), pCQV2 (Queen, ibid), and derivatives thereof. Plasmids may contain both 
viral and bacterial elements. 

Given the teachings provided herein, promoters, terminators and methods 
for introducing expression vectors encoding sefA, £, sefC, se/D, agfA, tctA, tctB, and/or 
5 tctC genes of the present invention into plant, avian, fish and insect cells would be 
evident to those of skill in the art. The use cf baculoviruses, for example, as vectors for 
expressing heterologous DNA sequences in insect cells has been reviewed by Atkinson 
et al. (Pestic. ScL 25:215-224,1990). In addition, the use of Agrobacterium rtnzogenes 
as vectors for expressing genes in plant cells has been reviewed by Sinkar et al. (J. 

10 BioscL (Bangalore) 77:47-58, 1987). 

Host cells containing vector constructs of the present invention are then 
cultured to express a DNA molecule as described above. The cells are cultured 
according to standard methods in a culture medium containing nutrients required for 
growth of the chosen host cells. A variety of suitable media are known in the art and 

15 generally include a carbon source, a nitrogen source, essential amino acids, vitamins and 
minerals, as well as other components, e.g., growth factors or serum, that may be 
required by the particular host cells. The growth medium will generally select for cells 
containing the DNA constructs) by, for example, drug selection or deficiency in an 
essential nutrient which is complemented by the selectable marker on the DNA construct 

20 or co-transfected with the DNA construct. 

Suitable growth conditions for yeast cells, for example, include culturing 
in a chemically defined medium, comprising a nitrogen source, which may be a non- 
amino acid nitrogen source or a yeast extract, inorganic salts, vitamins and essential 
amino add supplements at a temperature between 4°C and 37°C, with 30°C being 

25 particularly preferred. The pH of the medium is preferably maintained at a pH greater 
than 2 and less than 8, more preferably pH 5-6. Methods for maintaining a stable pH 
include buffering and constant pH control. Preferred agents for pH control include 
sodium hydroxide. Preferred buffering agents include succinic acid and Bis-Tris (Sigma 
Chemical Co., St Louis, Mo.). Due to the tendency of yeast host cells to 

30 hyperglycosylate heterologous proteins, it may be preferable to express the sefA, B, 
sefC, sefD, agfA, tctA, tctB, and/or tctC genes of the present invention in yeast cells 
having a defect in a gene required for asparagine-linked glycosylation. Such cells are 
preferably grown in a medium containing an osmotic stabilizer. A preferred osmotic 
stabilizer is sorbitol supplemented into the medium at a concentration between 0.1 M 

35 and 1.5 M, preferably at 0.5 M or 1.0 M. 

Cultured mammalian cells are generally cultured in commercially 
available serum-containing or serum-free media. Selection of a medium and growth 
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conditions appropriate for the particular cell line used is well within the level of ordinary 
skill in the art. 

ITT, Proteins 

5 

A. l>mtrins Generally 

As noted above, the present invention also provides isolated proteins. 
Within the context of the present invention, such proteins are understood to include the 
whole, or portions, of a gene product derived from one or more of the sefA, sefB, se/C, 

10 seJD. agfA, tctA, tctB, and/or tctC genes, or derivatives thereof as discussed above. 
Where the protein is a portion of a native gene or is encoded by derivative of a native 
gene, the protein maintains substantially the same biological activity of the native 
protein. The structure of the proteins corresponding to the sefA. sefB, sefC, seJD, agfA, 
tetA, tctB, and/or tctC genes can be predicted from the primary translation products 

15 using the nydrophobichy plot function of, for example, P/C Gene or Inteffigenetics Suite 
(Inteffigenetics, Mountain View, Calif), or according to the methods described by Kyte 
and Doolittle (J. Mol Biol 757:105-132, 1982). 

B. Purification of Proteins 

20 Proteins can be isolated by, among other methods, culturing 

suitable host and vector systems to produce the recombinant translation products of the 
present invention. Supernatant* from such cell lines, or protein inclusions or whole cells 
where the protein is not excreted into the supernatant, can then be treated by a variety of 
purification procedures in order to isolate the desired proteins. For example, the 

25 supernatant may be first concentrated using commercially available protein 
concentration filters, such as an Amicon or Millipore Pellicon ultrafiltration unit. 
Following concentration, the concentrate may be applied to a suitable purification matrix 
such as, for example, an anti-protein antibody bound to a suitable support. 
Alternatively, anion or cation exchange resins may be employed in order to purify the 

30 protein. As a farther alternative, one or more reverse-phase high performance liquid 
chromatography (RP-HPLC) steps may be employed to further purify the protein. 
Other methods of isolating the proteins of the present invention are well known in the 
skill of the art. 

A protein is deemed to be "isolated" within the context of the present 
35 invention if no other (undesired) protein is detected pursuant to SDS-PAGE analysis 
followed by coomassie blue staining. Within other embodiments, the desired protein can 
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be isolated such that no other (undesired) protein or LPS is detected pursuant to SDS- 
PAGE analysis followed by silver staining. 

TV. re positions Carwhte nf Elicitin g *" immune Response and Vaccines 

5 

A Peptide Compositions 

The present invention provides isolated compositions capable of eliciting 
an immune response, such as imrnunogens, comprising the amino acid sequences shown 
in Figures 2 to 7B, or substantial equivalents thereof. Various portions of such amino 
10 acid sequences can also be utilized within the context of the present invention. The 
immunogen may be selected from a portion of the amino acid sequences that is as small 
as 5 amino acids or as large as about 40 to 50 amino acids, but preferably the portion 
will be about 12 to 35 amino acids in length. In preferred embodiments, the hnmunogen 
comprises a GWPQ amino acid sequence or a seJD amino acid sequence. 
15 As will be understood by one of ordinary skill in the art, slight deviations 

of the amino acid sequences can be made without affecting the immunogenidty of the 
immunogen Substantial equivalents of the above proteins include conservative 
substitutions of amino acids that maintain substantially the same charge and 
hydrophobic^ as the original amino acid. Conservative substitutions include 
20 replacement of valine for isoleucine or leucine, and aspartic acid for glutamic acid, as 
well as other substitutions of a similar nature (See Dayhoffet al. (ed.), "Atlas of Proton 
Sequence and Structure," NatL Biomed Res. Fdn., 1978). 

Fimbriae, preferably one or more of SEF14, SEF17, SEF18 and SEF21, 
when expressed on whole cell (either viable whole cells or in bacterin form) or when 
25 presented in a purified form, are capable of eliciting an immune response in animals. In 
addition, the proteins can be used to generate either polyclonal or monoclonal 
antibodies. In a prefered embodiment, the isotype of the antibodies (preferably 
monoclonal antibodies) is IgA. Thus, the fimbriae can be an important bacterial cell 
component in generating host mucosal imrmmity to Salmonella, an enteric pathogen. 
30 As will be evident to one of ordinary skill in the art, the imrnunogens 

listed above, including their substantial equivalents, may stimulate different levels of 
response in different animals. The imrnunogens listed above, including their substantial 
equivalents, can be tested for effectiveness as a vaccine in experiments as described 
below in the Examples. These experiments include the T-cell proliferation assays, 
35 determination of lymphokine production after stimulation, and immunoprotection trials. 
Briefly T-cell proliferation assays can be utilized as an indicator of potential for cell 
mediated immunity. Additionally, evidence of lymphokine production after stimulation 
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by an immunogen can be utilized to determine the potential for protection provided by 
an immunogen. 

Finally, as described below, trials can be performed in order to determine 
the level of eUcitation of an immune response, including actual immunoprotection, in 
5 animals. In the case of humans however, instead of initial immunoprotection trials it is 
preferred to first screen peripheral blood lymphocytes (PBLs) from patients infected 
with Salmonella in the following manner. Briefly, PBLs can be isolated from diluted 
whole blood using Ficoll density gradient cenlrifugation and utilized in cell proliferation 
studies with 3 H -*ynudine as described below. Positive peptides are then selected and 

10 utilized in primate trials. 

The immunogens or peptides of the present invention can be readily 

produced utilizing many techniques well known in the art (see Sambrook et al., supra. 
Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
1989). Particularly preferred is the synthesis of the immunogens utilizing conventional 
15 peptide synthesizers. 

B. Attenuated Bacterial Compositions 

1. fompostftion-s using attenuated Salmonella 

20 As discussed above, an aspect of the present invention is the induction of 

mutations in the tail, tcUI or tctfl operons, which are responsible for citrate transport in 
Salmonella. Because Salmonella containing such mutations are not able to scavenge for 
certain nutrients from a host celL the Salmonella is typically unable to proliferate within 
a host cell. However, a Salmonella having an inoperative tricarboxylic acid transport 

25 system is still capable of being taken into a host cell, giving an infected celL and is able 
to persist within the host cell for some time, thereby allowing the Salmonella to elicit a 
response from the immune system of the host animal. Such a Salmonella is known as an 

attenuated Salmonella. 

In an alternative embodiment, the Salmonella is attenuated via 

30 introducing a mutation into two or more of its fimbrial genes, sefA, se/D, agfA, and 
JhnA, wherein the mutation effectively prevents production of the selected fimbriae by 
the Salmonella. In preferred embodiments, the mutations are induced in tbefimA gene 
(SEF21) and the agfA gene (SEF17). In an alternative preferred embodiment, the 
mutations are induced in the^ (SEF21), agfA gene (SEF17) and sefD gene (SEF18). 

35 Further, where S. enteritidis is a concern, the group of mutations will preferably include 
a mutation in the sefA gene (SEF14). Alternatively, because SefA protein and SEF14 
fimbriae are known to induce a T-cell response, the group of mutations may also include 
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a mutation in sefA that permits production of SEF14 but disrupts any toxic effect that 
may be due to SEF14. Preferably, such a mutation in sefA is caused by inserting a 
foreign nucleic acid molecule in an appropriate portion of the sefA gene. 

Other mutations that provide an attenuated Salmonella are also known 
5 and, while not prefered, can be useful within some aspects of the present invention. 
Examples of such attenuating mutations include the following: galE mutants, which 
lack UDP galactose epimerase (such mutants also reversibly lack LPS and the Vi 
antigen) (Nnalue and Stacker, Microb. Path. 7:299-310, 1989; Gennanier and Rurer, J. 
Infect Dis. 757:553-558, 1975); aroA or baroA mutants, which are blocked in aromatic 

10 amino acid biosynthesis, as well as other aro genes including aroC and aroD (Hoiseth 
and Stacker, Nature 797:238-239, 1981; Dougan et al., Molec Gen. Genet. 207:402- 
405, 1987; Brown et al, J. Infect. Dis. 155X6-92, 1987); Jones et aL, Vaccine 929-34, 
1991; Mukkur etal., /. Med Microbiol. 34:57-62, 1991; Robertsson etal. Infect 
Immun. 41:142-150, 1983; Cooper et aL, Microb. Pathog. 9:255-265, 1990); AcyaAcrp 

15 combined deletion mutants, which lack adenylate cyclase and cAMP receptor protein 
(Curtiss and Kelly, Infect. Immun 5:3035, 1987); tSphoP mutants, which lack the 
regulatory gene involved in acid phosphatase production: (Galan and Curtiss. Microb. 
Path. 6:433, 1989; Miller et al. Res. Microbiol. 141:817, 1990); purA or /spur purine 
biosynthesis mutations, which are typically not useful alone, but rather when used in 

20 combination with aroA and galE (Nnalue and Stacker, Infect Immun. 55:955, 1987; 
Sigward etal., Infect Immun. 57:1858-1861, 1989); Acdt mutants, which have a 
mutation of the chromosome that blocks Salmonella from colonizing deep tissue 
(Curtiss etal., Vet Microbiol 37:397-405, 1993); phoP* mutants, which are 
constitutive producers of phoP (Miller and Mekalanos, J. Bacteriol. 772:2485-2490, 

25 1990.); streptomycin independent reverse mutants (Pardon, Res. Microbiol. 141:945- 
953, 1990); attachment/invasion deficient mutants of S. choleraesuis (Wilson et al., Res. 
Microbiol. 747:827-830, 1990); temperature-sensitive S. enteritidis mutants (Onozuka 
et al., Int J. Immunopharm. 77:781-787, 1989); his mutants in combination with haroA 
and b>pur mutants (Edwards and Stacker., J. Bacteriol. 770:3991-3997, 1989); Asp", 

30 Hst, Rbt, Rtt (asp arapme auxotroph, high sensitivity to tensides, reversion to bile 
tolerance, reversion to tenside tolerance) mutants, which have various mutations that are 
useful to generate strains with graded attenuation: (Linde et al., Vaccine 5:278-282, 
1990); htrA mutants, which lack a stress protein serine protease: (Strahan et al., Microb. 
Path. 72:31 1-317, 1992); and, mutants that lack the virulence plasmid (Nakamura et al., 

35 Infect. Immun. 50:586, 1985; Barrow, Infect. Immun. 58:2283, 1990) 
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In preferred embodiments, the host animal is a fish or a warm-blooded 
animal, and further preferably is a food animal such as poultry, swine or cattle, or even 
further preferably a human being. 

In a preferred embodiment, the attenuated Salmonella have been 
5 engineered to express a foreign gene (including portions thereof) fused to one or more 
genes that code for the fimbrin proteins SefA,, SefD or AgfA. In a preferred 
embodiment, the fusion protein comprises a portion of the SEF17 fimbrin, which is 
encoded by the agfA gene operably linked in open reading frame to the gene for the 
foreign antigen. In an alternative preferred embodiment, the fusion protein comprises a 
10 portion of the SEF14 fimbrin, encoded by the sefA gene, the SEF18 fimbrin, encoded by 
the sefD gene, or the SEF21 fimbrin, encoded by the fimA gene. By "foreign antigen" it 
is meant an antigen foreign to the host cell and not typically expressed in a Salmonella 
fimbriae. Preferably, the foreign antigen is foreign to both the host cell and the 
Salmonella. The "foreign antigen" includes a whole antigen, as well as a portion of an 
15 antigen, comprising numerous epitopes, and also includes an antigen, or portion of an 
antigen, that is a single epitope. 

In a preferred embodiment, the attenuated Salmonella composition, 
including an attenuated Salmonella composition comprising a foreign antigen, is an LPS 
O-polysaccharide deficient strain of Salmonella able to express SEF18 or SEF17 
20 fimbriae (which include the foreign antigen, where such is present). In alternative 
embodiments, the fusion proteins are not expressed as fimbriae, but rather as protein 
aggregates, inclusion bodies or stable polymers of AgfA, SefA, SefD, SefC, Se£B, TctC, 
TctB or TctA. In a further alternative embodiment, the fusion protein comprises more 
than one foreign antigen. 

25 

2. Compositions using attenuated K cou 
In an alternative embodiment, the attenuated bacteria can be an 
attenuated E. coli able to express one or more Salmonella sefA, sefD, sefC, sefB, agfA, 
tctC, tctB or tctA fimbriae genes. In a preferred embodiment, the expressed gene is the 
30 agfA gene. In alternative preferred embodiments, the expressed gene is the sefA gene, 
the sefD gene, or the fimA gene. In an alternative embodiment, the peptide 
corresponding to the desired gene is not displayed as an intact fimbriae, but is still able 
to elicit an immune response from the host organism. In preferred embodiments, the 
host animal is a fish or a warm-blooded animal, and further preferably is a food animal 
35 such as poultry, swine or cattle. Further preferably, the host animal is a human being. 

In a preferred embodiment, the E. coli or other bacterial host cell 
expresses a Salmonella AgfA, FimA, Se^, SefD, SefC, SefB, TctC, TctB or TctA 
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protein fused to one or more foreign antigens. The Salmonella protein is further 
preferably a fimbrin protein, further preferably SefA, AgfA, SefD or FimA. Further, 
preferably both the Salmonella-based antigen and the foreign antigen are able to elicit a 
response from the immune system of the host animal, yielding a multipurpose 
S composition/immunogen. 

C. Nucleic Acid Vaccines 
1. Generally 

Direct injection, or other appropriate introduction, of one or more of the 

10 se/U2f sefUj, sefA, seJB, sejC, seJD tctC, tctB, tctA, or agfA genes into an animal can 
elicit an immune response in the animal, and preferably vaccinate, against the peptide 
that is expressed from the given gene, and therefore Salmonella. In one embodiment, 
the nucleic acid that is injected further comprises an antigen from a foreign microbe, 
thereby providing a composition able to elicit an immune response against microbes in 

15 addition to Salmonella. 

In an example of this procedure, naked DNA is introduced into an 
appropriate cell, such as a muscle cell, where it produces protein that is then displayed 
on the surface of the cell, thereby eliciting a response from host cytotoxic T- 
lymphocytes (CTLs). This provides an advantage over traditional immunogens wherein 

20 the elicited response comprises specific antibodies. Specific antibodies are generally 
strain-specific and cannot recognize the corresponding antigen on a different strain. 
CTLs, on the other hand, are specific for conserved antigens and can respond to 
different strains expressing a corresponding antigen ("Heterologous protection against 
influenza by injection of DNA encoding a viral protein," Ulmer et al., Science 259: 1745- 

25 1749 (1993); "Expression of recombinant genes in myocardium in vivo after direct 
injection of DNA," Lin et al., Circulation 52:2217-21 (1990); "Long-term persistence of 
plasma DNA and foreign gene expression in mouse muscle," Wolff et al., Human MoL 
Gen. 7:363-69 (1992)). 

In preferred embodiments, the vector construct containing one or more 
30 of the gene sequences of the present invention is naked and is injected into a muscle of 
an animal, resulting in the uptake of the vector construct by the muscle cells of the 
animal, and expression of the protein encoded by the DNA. In preferred embodiments, 
the vector construct is injected into fish or warm-blooded animals, including birds, mice, 
rats, primates and human beings. Further, it is preferred that the vector construct is a 
35 DNA expression vector wherein the desired gene is under the control of a Rous sarcoma 
virus (RSV) or cytomegalo vims (CMV) promoter (Ulmer et al., supra, Lin et al.). 
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Further preferably, the plasmid resists incorporation into the chromosomal DNA of the 
animal's cell, and does not replicate within the animal's cell (Wolff et at, supra). 

Upon introduction of the naked vector construct into the animal's cell, 
the construct is then able to express the gene that it carries, which gene preferably 
5 comprises one (or more) of the sefUj, sefU^ sefA t sefB, sefC, sefD, tctC, tctB, tctA, or 
agfA genes. Accordingly, upon expression of the desired peptide, an immune response 
is elicited from the host animal. Preferably, the immune response includes CD8 + CTLs 
able to respond to different strains that exhibit a form of the desired peptide. 

In another preferred embodiment, one of the desired proteins discussed 
10 above is operably fused in an open reading frame to a second nucleotide sequence (or 
more) that codes for a foreign antigen, to yield a single dicistronic protein exhibiting 
antigenic characteristics of both Salmonella and another, foreign organism. 

D. Administration of Compositions Able to Elicit an Immune Response 

15 1. Peptide Compositions 

The present invention provides methods for simulating an immune 
response in warm-blooded animals comprising administering an effective amount of a 
pharmaceutical composition comprising an immunogen and a physiologically acceptable 
carrier or diluent. For purposes of the present invention, warm-blooded animals include, 

20 among others, humans, primates, dogs, cats, pigs, sheep, horses, rats, mice, chickens, 
turkeys and other food animals. The methods can also be used for stimulating an 
immune response in cold-blooded animals, preferably food animals such as fish. 

Many suitable carriers or diluents can be utilized in the present invention, 
including among others saline, buffered saline, and saline mixed with nonspecific serum 

25 albumin. The pharmaceutical composition may also contain other excipient ingredients, 
including adjuvants, buffers, antioxidants, carbohydrates such as glucose, sucrose, or 
dextrins, and chelating agents such as EDTA. Within a particularly preferred 
embodiment, an adjuvant is utilized along with the immunogen. Examples of such 
adjuvants include alum or aluminum hydroxide for humans. 

30 The amount and frequency of administration can be determined in clinical 

trials, and will depend upon such factors as the Salmonella species against which it is 
desired to protect, the particular immunogen used, the degree of protection required, 
and many other factors. In a preferred embodiment, the composition is administered 
orally, and the attenuated Salmonella are taken up by cells, such as cells located in the 

35 lumen of the gut. Alternatively, the composition can be parenterally administrated via 
the subcutaneous route, or via other routes. Depending upon the application, quantities 
of injected immunogen will vary from 50 |ig to several milligrams in an adjuvant vehicle 
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and preferably about 100 \ig to 1 mg, in combination with a physiologically acceptable 
carrier or diluent. Booster immunizations can be given from 4-6 weeks later. 

2. Attenuated Bacterial Compositions 
5 Attenuated bacterial compositions are preferably administered orally in 

combination with a physiologically acceptable carrier or diluent. However, attenuated 
bacterial compositions can also be administered via the same mechanisms as peptide- 
based compositions, such as via injection in combination with an adjuvant. 

The following Examples are offered by way of illustration, and not by 
10 way of limitation. 

EXAMPLES 

15 EXAMPLE 1 

Cloning of the azfA gene of Salmonella enteritidis 

An AgfA-negative TnphoA insertion mutant of S. enteritidis 27655-3b, 
named strain 2-7£ was constructed (Coliinson et aL, "Thin, aggregative fimbriae mediate 

20 binding of Salmonella enteritidis to fibronectin," J. BacterioL 775:12-18, 1993). The 
strain contains an agfA-TnphoA gene fusion. Stock cultures of strain 2-7f were 
prepared using cells from mid-exponential phase cultures that were mixed with 7% 
glycerol and then stored at -80°C in Luria-Bertani (LB) broth. Isolated colonies of 
strain 2-7f were obtained by inoculation of solid LB medium followed by incubation at 

25 37°C for 24 hours. An individual colony was inoculated in 2 ml of LB broth contained 
in a sterile 18 mm x 150 mm test tube and the inoculation was incubated for 48 hours at 
37°C under static conditions. For the purpose of isolation of DNA for amplification of 
agfA, cells may be equally effectively prepared by growth in LB broth, Colonization 
Factor Antigen (CFA) broth, T (tryptone>medium, other suitable proteolytic digest- 

30 based medium, or other medium suitable to support the growth of Salmonella. Cells 
may also be grown under aeration, such as by growth of the culture in an Erlenmeyer or 
other flask positioned on a rotary or gyratory shaking device. Preferably, the culture is 
grown at temperatures between 20°C and 37°C. 

Cells of strain 2-7f composing the pellicle at the surface of a static culture 

35 and cells in suspension were mixed by vortexing for 1 minute. Cells were harvested 
from 1 ml of this cell suspension (approximately 20 mg wet weight of cells) transferred 
to a 1.5 ml polypropylene microfuge tube and centrifuged (16,000 x g for 5 min. at 21° 
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C). The pelleted cells were saved and subsequently resuspended in 1 ml f distilled 
water. The cells were lysed to release cellular DNA by boiling the 1 ml sample in a 
sealed microfage tube for 5 minutes. The cell lysate was partially clarified by 
centrifugation (16,000 x g for 10 min. at 4°C) in a microfuge to pellet cell debris. The 
5 crude DNA preparation (supernatant fraction) was used as a substrate for amplification 
of a 394 bp DNA fragment encoding the majority of the SEF17 fimbrin subunit, agfA. 

Standard molecular cloning techniques were performed according to 
protocols described by Sambrook et al. {Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Press, Cold Spring Harbor, New York, 1989), or by incorporating 

10 minor modifications thereto that are well known in the ait. To facilitate cloning of an 
amplified agfA fragment, PCR primers were constructed to include restriction 
endonuclease cleavage sites. The 5* PCR primer, called TAF1, was a 34 bp biased, 
degenerate oligonucleotide dGGCGGAAGCTTGAATTCGTrA/C/TIGTrA/C/TIC 
C[A/G/T]CA[A/G]TGGGG) (SEQ ID No. ), of which the 17 bases at the 3* end 

15 correspond to amino acid residues 2 to 7 of the AgfAN-terminus (the final nucleotide of 
amino acid 7 is degenerate, and therefore was not made a part of the primer). The 
amino acid sequence of the N-terminus had been previously determined by Collinson 
et al. (supra). The underlined sequences were required to create HintiKL and EcoRl 
cleavage sites in the product of DNA amplification. The 3' PCR primer, called TAF2, 

20 (dGGGAAAGGTTGAATTCAGGACGCTACTTGTG) (SEQ ID No. ), into which 

three nucleotide changes (underlined) were introduced to create an EcoRI ate in the 
PCR product, was complementary to the IS50L sequence residing at the junction of 
TnphoA generated alkaline phosphatase gene fusions. The amplified 394 bp agfA 
fragment was isolated after agarose (1.5%) gel electrophoresis using a 40 mM Tris- 

25 acetate, 1 mM EDTA buffer system and then purified using Gene Clean n glassmilk 
following the 'double Gene Clean 1 protocol recommended by the manufacturer (Bio 101 
Inc.,LaJolla,CA). 

Approximately 0.5 jig of the amplified agfA fragment was cleaved with 
the restriction endonuclease EcoRI at a concentration of 5 units per \ig of DNA. In 

30 addition, 0.5 \ig of the plasmid vector pUC19 (Yannisch-Perron et al., "Improved M13 
phage cloning vectors and host strains: nucleotide sequences of the M13mpl8 and 
pUC19 vectors," Gene 35:103-119, 1985) was similarly digested with EcoRl. After 
digestion, the DNA samples were extracted once with 0.75 volume of buffered-phenol 
and then with 0.75 volume of chloroform to separate enzyme from DNA. The digested 

35 DNA fragments were precipitated in the presence of 2.5 volumes of 95% ethanol and 
0.3 M sodium acetate, pH 5.4. Residual ethanol was removed from the DNA pellets 
under vacuum. 
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The DNA was dissolved in 20 *il of 10 mM TrisHCl - 1 mM EDTA, pH 
8.0. Ligation of agfA DNA fragments into pUC19 was conducted in 50 ^ ligation 
buffer using 7 units of T4 DNA ligase. Clones containing recombinant plasmids were 
obtained by transformation of competent cells of Escherichia coli strain DH5a obtained 
5 from GIBCO BRL Life Technologies Inc. (Burlington, Ontario, Canada) with a sample 
of ligated DNA. Transformed cells were selected on solid LB medium containing (200 
^ig/ml) ampicillin, 50 \M IPTG (isopropyl-P-D-thiogalactopyranoside) and 0.005% X- 
GAL (5-bromo-4-chloro-3-indoyl-p-D-galactoside). Ampicillin resistant colonies that 
contained the recombinant plasmids were identified by their white color. Recombinant 

10 plasmid DNA was purified from transformed cells using a modification of the standard, 
small scale, alkaline lysis technique described by Sambrook et al. {supra). Recombinant 
plasmids were purified from three individually isolated colonies. These three, apparently 
identical, plasmids were designated pAGFl, pAGF3 and pAGF4. 

In order to obtain a clone containing the full agfA gene, chromosomal 

15 DNA of S. enteritidis strain 27655-3b was purified by CsCl gradient centrifugation and 
digested separately with Hindm or Dral and analyzed by Southern hybridization at 
65°C using a random-primer, [a- 32 P]dATP labeled agfA PGR fragment according to 
methods described by Sambrook et al. (supra). FoDowing stringent filter washing at 60° 
C to 62°C as described above, a genomic fragment of approximately 3 kb was identified. 

20 Hindm or Dral digested strain 3b DNA was fractionated according to size by sucrose 
gradient centrifugation (Sambrook et al., supra). DNA fragments contained in the 
fraction found to hybridize to the [ 32 P]-labeled agfA PCR fragment were ligated into 
M13mpl8 (Yannisch-Perron et al., supra) at the Hindm site or Smal site within the 
multiple cloning site at 12°C using 12 units of ligase and a total of 1 ng of DNA at a 

25 final concentration of 50 ng/|il. Recombinant plaques prepared on a lawn of E. coli 
JM109 (Yannisch-Perron et al., supra), containing agfA were identified by dot blot 
hybridization (Sambrook et al., supra) using the [ 32 P]-labeled agfA PCR fragment. The 
recombinant M13mpl8 bacteriophage containing agfA were plaque purified in the 
replicative form, double-stranded DNA was purified and the insert fragments encoding 

30 agfA were cloned into pUC18 (Yannisch-Perron et al., supra), and transformed into 
K coli strain DH5ct as described by Sambrook et al. (supra). The recombinant plasmid 
composed of pUC18 and the approximately 3kb Dral fragment of strain 3b DNA was 
named pDAG6. The recombinant plasmid formed from pUC18 and the approximately 
3kb HindUl fragment of strain 3b genomic DNA was named pHAGlO. 

35 

.J 
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EXAMPLE 2 
Determination of the DNA sequence of ag fA 

The DNA sequences of both strands of the agfA PCR fragment of 
5 Example 1 were determined by a modification of the enzymatic dideoxy-termination 
sequencing method (Sanger et al., "DNA sequencing with chain-termination inhibitors," 
Proc. Natl Acad ScL USA 7*5463-5467, 1977) using the primers TAF1 and TAF2 
and the reagents and protocols supplied in the Sequenase™ Version 2.0 kit (United 
States Biochemicals, Cleveland, Ohio). The sequence was confirmed by determining the 

10 DNA sequence of agf A contained in two of the independent plasmid clones. For this 
determination, an Applied Biosystems model 373A automated DNA sequencer and 
associated reagents, protocols. and software (version 1.10) for thermal cycle sequencing 
(Applied Biosystems Canada Inc., Mississauga, ON) was used. The universal forward 
and reverse sequencing primers (Yannisch-Perron et al., supra) were used. The DNA 

15 sequence of agfA encoded on the overlapping HindlH and Dral fragments cloned in 
pHAGlO and pDAG6 was determined on both strands using double-strand DNA 
sequencing techniques. 

Computer analysis of the DNA sequence for open reading frames and 
gene translation was performed using DNA Strider, version 1.1. The start of the open 

20 reading frame of the agfA gene encoding the mature AgfA fimbrin was recognized by 
comparison of the translated sequence to the N-terminal amino acid sequence of AgfA 
The agfA DNA sequence is presented in Figure 7B. The translated DNA sequence of 
the single open reading frame corresponded precisely to amino acid residues 2 to 3 1 
determined by N-terminal sequencing of AgfA The amino acid composition of the 

25 translated sequence had a similar high glycine content (16%), high combined alanine, 
serine plus glycine content (37%), low basic amino acid content (4.5%X and nearly 30% 
asparagine plus aspartic acid content consistent with the total amino acid analysis of 
native AgfA fimbrin (8). Based on the estimated molecular weight of AgfA of 17 KM r , 
and the expected molecular weight of the protein encoded in the 333 bp region ofagfA, 

30 about 12 K- 13 K A/ r , it appears that about three quarters of SEF17 fimbrin is 
represented in the fragment. 
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EXAMPLE 3 
Production and expression of assembled, 
enzvmaticallv active AgfA-enzvme fusion proteins 
in Salmonella enteritidis 

5 

Salmonella enteritidis produce novel thin aggregative fimbriae, known as 
SEF17, that comprise a class of stable protein polymers requiring treatment with 90% 
formic acid to depolymerize the fimbriae into their composite fimbrin subunits 
(Collinson, supra). SEF17-negative TvphoA mutants were isolated that were defective 

10 in the production of SEF17 fimbriae due to transposon mutagenesis resulting in the 
fusion of the SEF17 fimbrin gene, agfA, with the promoteriess alkaline phosphatase 
gene, phoA (Collinson et al., "Thin, aggregative fimbriae mediate binding of Salmonella 
enteritidis to fibronectin," 7. Bacteriol 775:12-18, 1993). Western blot data indicated 
that the AgfA-PhoA fiision protein expressed by the IvphoA mutant 2-7f either 

15 aggregates or self assembles into polymers because formic acid treatment was required 
to disassemble these polymers into the AgfA-PhoA subunits. 

EXAMPLE 4 
Sequencing of the tctCBA gene cluster 
20 of Salmonella tvphimurium 

The tricarboxylic acid transport (tctl) operon of Salmonella typhimurium 
LT2 was isolated on an 8 kb EcoRI-BamHl fragment cloned into the vector pBR322 to 
create the recombinant plasraid pKWIOl which conferred a citrate-utilizing (c/Y+) 

25 phenotype on E. coli MC4100 (Widenhorn et al., "Cloning and promoters of the 
Salmonella typhimurium tricarboxylate transport operon in Escherichia coli" 
J. Bacteriol. 770:883-888, 1988). pKWIOl was digested with the restriction enzymes 
Kpnl and Pstl to yield a fragment of 4.5 kb, containing the entire tctl operon, which was 
inserted into the M13 cloning vector M13mpl8 (Yannisch-Perron etal., supra) to 

30 produce the recombinant clone KS1016. KS1016 was digested with the restriction 
enzymes £coRI and Hindm (which cut only in the vector DNA) to release the tctl 
fragment for subcloning into the Ml 3 cloning vector MWB2349 to produce MKS3. 
(Barnes et al., "Kilo-Sequencing: Creation of an Ordered Nest of Asymmetric Deletions 
Across a Large Target Sequence Carried on Phage M13," Meth. Em. 707:98-122, 

35 1983.) 

To facilitate determining the DNA sequence of tctCBA, the 4.5 kb 
£coW-i/fndin tctl fragment was subcloned from KS1016 into the M13 cl ning vector 
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MWB2341, whose multiple cloning she is oriented in the opposite direction relative to 
the binding site of the universal DNA sequencing primer. This resulted in the creation of 
MKS11. 

The recombinant clones MKS3 and MKSll were subjected to a 
5 procedure generating nested deletions pale et al., "A rapid single-stranded cloning 
strategy for producing a sequential series of overlapping clones for use in DNA 
sequencing: Application to sequencing the corn mitochondrial 18S rDNA," Plasmid 
75:31-40, 1985). Briefly, angle-stranded recombinant M13 DNA was isolated by 
standard procedures (Sambrook et al., supra). DNA was linearized by Hindffl digestion 
10 following hybridization of the oligonucleotide WK-10, S'-TGAATTAATTCCA 

CAAGCTTTTTTTTTTTO^ to MKS3 (SEQ ID No. ) or WK-9, 5'- 

CGACGGCCAGTGCCAAGCTTTITITTTT-3 f to MKSll (SEQ ID No. ) to 

create a double stranded restriction endonuclease cleavage site in an otherwise single- 
stranded molecule. 

15 The linearized DNA was subsequently degraded by the 3' to 5* 

exonuclease activity of T4 DNA polymerase. Samples were withdrawn at 15, 30, 45, 
60, 75, 90 and 105 minutes. The polymerase was inactivated by treating the sample at 
65°C for 2 minutes and the samples stored on ice. By withdrawing samples at different 
times, populations of DNA fragments with differing degrees of deletion were created. 

20 All time-point samples were pooled and the DNA was treated with terminal transferase 
in the presence of ATP to create a polyA tail. The deleted DNA molecules were 
hybridized with oligonucleotide WK-10 by heating to 65°C for 5 minutes, followed by 
slow cooling for 30 minutes, then ligated by T4 DNA ligase for 1 hour at 21°C and 
overnight at 4°C. The ligated fragments were transformed into K coli WB373 made 

25 competent by the procedure of Mandel (Mandel and Higa, "Calcium dependent 
bacteriophage DNA infection," J MoL BioL 53:159, 1970). Well-isolated M13 plaques 
were picked, and grown up overnight (Sambrook et al., supra) into 2X-YT broth. The 
size of each recombinant DNA molecule was estimated by agarose gel electrophoresis. 
From over 100 plaques analyzed, 35 recombinant phage representing deletions of 0.3 - 

30 4.8 kb were selected for DNA sequence analysis. 

DNA sequences were determined using standard modifications of the 
enzymatic dideoxy termination method of Sanger et al. (supra). To resolve the 
sequence of regions that proved difficult to determine by standard protocols, the 
sequencing reactions were performed using Sequenase (a chemically modified form of 

35 T7 DNA polymerase; US Biochemicals) and either deoxyinosine or 7- 
deazadeoxyguanosine in place of deoxyguanosine or standard protocol sequencing 
reactions were analyzed by wedge-gel electrophoresis. Whereas, most sequencing 
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reactions utilized the universal forward sequencing primer, specific internal primers were 
also used. The names and sequences of the internal primers used are: WWK-19 5'- 
GGGCGACTATCGCGTTA-3', WWK-20 5'-AGCCACTCGTAGCGGCC-3', WWK-21 
S'-GGAAGTGCATTTTACGTO', WWK-22 S'-CATGCTGCCAAGACAGG-S', WWK- 
5 23 S'-CTTTGGATCTGCCAGGC-S', WWK-24 5'-GCGCCGTCATGATCGCC-3' 

(SEQ ED Nos. , , , , , and , respectively). The sequences for tctA, 

tctB and tctC are shown in Figures 4A-4B, 5 and 6A-6B, respectively. 

The sequences of tctA, tctB, and tctC were confirmed by automated 
DNA sequencing using an Applied Biosystems, Inc. Model 3 73 A automated DNA 
10 sequencer and the reagents and protocols provided by the manufacturer for cycle- 
sequencing (Applied Biosystems, Inc., Foster City, CA). DNA oligonucleotide 
sequencing primers used for this purpose included: 

S'TCGGGATGCTGTTCGGCG 3 ' (SEQ. ID. No. ) 

15 5 'CTGCCTGCGGAGTCGGC 3 ' (SEQ. ID. No. ) 

5'GTCGCAAGGCCAAGACCG 3 ' (SEQ. ID. No. ) 

s'gtgtatcggcaccaccctg 3 ' (seq. id. no. ) 

s'cccggcgatgttcaccg 3 ' (seq. id. no. ) 

s'ccaataccgcgccggag 3 ' (seq. id. no. ) 

20 s'gcggaggcaatgatgagcg^seq.id.no. ) 

5'tgccgccatactcacagcc 3 ' (seq. id. no. ) 

5'tCTTGGCAGCATGATGGCG 3 ' (SEQ. ID. No. ) 

5'CTGGCAATGGTCGCCCG 3 ' (SEQ. ID No. ) 

5 'GCAATCAGCAGCGCAGC 3 ' (SEQ. ID. No. ) 

25 

A restriction map prepared from the DNA sequence appears in Figure 9. 
The Figure shows the positions of endonuclease cleavage sites for April, Avail, BglEL, 
BglL, Bstll, Dral, EcoKV, Kpn\ Nco\ Sail, Smal, SndBl, Sspl and Xmnl relative to the 
open reading frames of tctC, tctB and tctC. 
30 DNA sequences encoding TctC were recognized by comparison to the 

complete sequence of TctC protein; the predicted amino acid sequence agreed with the 
amino acid sequence determined by peptide sequencing studies on the purified protein. 
The predicted size of the TctB protein based on the nucleotide sequence agreed with the 
size of an expressed protein as determined by SDS-PAGE. 
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EXAMPLES 
Salmonella strains containing mutations in 
tricarfaoxvlate transport (tct) operons 

5 A. Construction of tctl mutants in S. enteritidis 

DNA encoding the Salmonella typhimurium tctl operon was cloned into 
pKW108 ("Expression of the divergent tricarboj^late transport operon (tctl) of 
Salmonella typhimurium, n J. Bacteriol 770:3223-3227, 1988). The DNA was cleaved 
with the restriction enzyme BglH, which cleaves the operon at a single ate in the tctC 

10 gene. The linearized DNA was then digested with exonuclease III and nuclease SI. 
Samples were withdrawn at selected time points routine in the art and the enzymatic 
activity was stopped by the addition of EDTA. After all of the samples had been 
removed, the DNA was pooled, purified, then treated with the Klenow fragment of 
K coli DNA polymerase I. The DNA was ligated with T4 ligase to give vector 

15 constructs that were then transformed into K coli DH5a (Sambrook et al., supra). 

The resultant transformants were grown up in L-broth supplemented with 
100 |ig/ml amptcillin overnight and a plasrnid preparation was performed by the alkaline 
lysis procedure (Sambrook et al., supra). The population of plasmids was sized on an 
agarose gel, and shown to consist of molecules of heterogeneous sizes. The plasmid 

20 DNA was pooled, then linearized with EcoRl and used to electrotransfonn S. 
enteritidis. The transformants were grown on solid modified Davis minimal medium 
containing 50 jig/ml fluorodtrate (Fc), to select for transformants that had integrated an 
inactive tctl operon into the cellular chromosome and thus were resistant to the toxic 
citrate analog fluorodtrate. 

25 Recombinants were screened by colony lifts followed by hybridization to 

probe complementary to the region that had been deleted. Colonies that hybridized 
poorly to the probe were picked and grown in L-broth overnight. The tctC gene was 
characterized by PCR using oligonucleotides complementary to the DNA sequence of 
nucleotides 1459-1481 and 1955-1977 of the tctl operon (see Figures 6A-6B). One of 

30 the transformants tested showed an internal deletion in the tctC gene of approximately 
300 nucleotides between 1459 and 1977. This bacteria was named S. enteritidis strain 
SL54. 

B. Construction of tctll' strain 
35 Constitutive "on" mutants of the tctll operon in wild-type Salmonella 

typhimurium SU453 were isolated by plating out cells on Modified Davis Minimal 
Medium containing 93.4 mM Na + , with cis-aconitate as the carbon source and no K + 
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ions (Na-cac medium). The resultant tctll° n mutants were named KS823-832. KS823 
was used as a recipient strain for P22 int3 HT12/4 transducing phage (B. Ely etal., 
"Some improved methods in P22 transduction, " Genetics 76:625-631, 1974) that had 
been grown on an S. typhimurium strain carrying a tctl operon that had been 
5 insertionally activated by TnlO, which carries a tetracycline resistance gene (N. 
Kleckner, "Transposon TnlO Mobile genetic elements," pp. 261-298, Academic Press, 
N.Y., 1983). Bacteria that were tctl' and tctlP n were selected for by growth on Na-cac 
medium in the presence of tetracycline. The tetracycline resistant strains resulting from 
this cross were called KS838-847. 

10 KS838 was determined to be sensitive to fluorocitrate (Fc). KS838 was 

grown in the presence of Fc on Na-cac medium, using the disk radial streak assay 
(Somers et aL, "Fluorocitrate-resistant tricarboxylate transport mutants of Salmonella 
typhimurium, 91 Mol Gen, Genet. 757:338-345), and mutants resistant to Fc (Fc R ) were 
collected. These resistant strains were considered tctfr, and were named KS858-867. 

15 Next, the tetracycline resistance of the mobile genetic element TnlO was removed by 
making KS858 a recipient strain for P22 transducing phage that had been grown on 
wild-type Salmonella SU453. These phage replace the tctlAocas with the wild-type 
locus. tctT*~ isolates were selected by growing on Davis Modified Minimal Medium 
supplemented with 20 mM isocitrate as a carbon source. The resultant strains were 

20 called KS950-959 and KS970-979, and are phenotypically tctl+, tctlt, tctlll*. These 
strains contain no foreign DNA or introduced antibiotic resistance genes. 

C. Construction of tctlll' mutants 

Salmonella typhimurium SU453 strains were used, through TnlO 
25 insertion, to produce a series of tctlll' isolates named 5. typhimurium KS1170 and 
KS1 169. The TnlO elements can be induced to excise through culturing the bacteria on 
medium containing fiisaric acid (S. Maloy et aL, "Selection for loss of tetracycline 
resistance by Escherichia coli" J. Bacteriol 140:291-300, 1981). Several strains will 
suffer a deletion of tctlll flanking sequences concurrent with imprecise TnlO excision 
30 and then will be useful as attenuated strains. 

EXAMPLE 6 
Sequencing and characterization of sefU {Ui ABCD 
from the sef operon of Salmonella enteritidis 

35 

A 5.3 kb HindUl fragment carrying the SEF14 fimbrin structural gene 
and part of the sef operon was subcloned from cos48> a recombinant cosmid carrying 44 
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Icb of 5. ententidis chromosomal DNA, into pTZ19R to generate pKXl (Feutrier et al., 
"Cloning and expression of & Salmonella ententidis fimbrin gene in Escherichia co//," J. 
BacterioL 770:4216-4222, 1988; Mttller etal., "Fimbriation genes of Salmonella 
ententidis," 7. BacterioL 777:4648-4654, 1989). A series of overlapping deletion 
5 subclones of pKXl (ApKXl) were generated using pKXl linearized with EcdRI and 
varying degrees of DNAse I digestion according to the method of Lin et al. ("An 
improved DNA sequencing strategy," Anal. Biochem. 747:114-119, 1985), to create a 
collection of 50 nested deletions. The resulting linear ApKXl subclones were treated 
with the Klenow fragment of DNA polymerase I and then blunt end ligated with T4 

10 DNA ligase to recircularize the plasmids. The various ApKXl were transformed into E. 
coli XL-1 Blue (Stratagene, La Jolla, CA) using standard procedures (Sambrook et al., 
supra). The ApKXl were purified by standard alkaline lysis procedures (Sambrook 
et al., supra) and run on a 1% agarose gel. A series of ApKXl subclones separated by 
about 200 to 400 bp in size were chosen and named deLAlO, delBIS, deIB23, delCl, 

15 delD5, delD8, deID9, delD16, delD19, delEl, and delE21. Large amounts of these 
plasmids were purified by alkaline lysis for DNA sequencing. 

The ApKXl subclones were sequenced by the dideoxynucleotide chain 
termination method (Sanger et al., "DNA sequencing with chain terminating inhibitors," 
Proc. NatL Acad ScL USA 74:5463-5467, 1977) using T7 DNA polymerase (T7 DNA 

20 Polymerase Sequencing Core System, Deaza, Promega, Madison, WI) and 
deoxyadenosine 5-[<x-35S] triphosphate, (New England Nuclear, Markham, ON) 
according to the manufacturers' specifications. 

The result of DNA sequencing reactions were electrophoresed through a 
6% polyacrylamide gel (45 W, 55-60°C) using a discontinuous buffer system (Carninci 

25 et al., n A simple discontinuous buffer system for increased resolution and speed in gel 
electrophoretic analysis of DNA sequence," Nucleic Acid Res. 75:204, 1989). 
Following electrophoresis, the gels were fixed in a solution of 12% methanol and 10% 
acetic acid for 15 min, and dried onto 3MM paper (Whatman Intl. Ltd., Maidstone, 
England) under vacuum at 80°C for 2 hours on a Savant gel drying apparatus (Savant 

30 Instruments Inc., Farmingdale, NY). Dried gels were exposed to X-Omat K XK-1 film 
(Kodak, Rochester, NY) and the sequence read directly from the developed films. Both 
DNA strands were fully sequenced, using the 17 bp reverse primer for the coding strand 

(GTCATAGCTGTTTCCCG) (Sequence ID No. ) and 12 custom made internal 

oligonucleotide primers (ULTRA Diagnostics Corporation, Seattle, WA) for the 

35 opposite strand. To complete the sequence for the seJD gene, and to sequence the 
se/Uj and sefU2 S enes > a further 10 kb Kpnl fragment was obtained from cos48 and 
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subjected to sequence analysis generally as above. The sefA, sejB, sefC and sefD 
sequences are depicted in Figures 2 A-2D. 

The genes for sejUj and sefUi were similarly sequenced on overlapping 
subclones and these sequences are depicted in Figures 3A-3B. 
5 The programs contained in MacVector (Intelligenetics, Mountain View, 

CA) were used to determine the order of the overlapping DNA sequences. DNA 
Strider™ version 1.1 was used to identify the open reading frames for sefA, sefB and 
sefC, which were predicted to encode polypeptides of 14,436 Mr, 28,012 Mr and 
90,268 Mr, respectively. The predicted amino acid sequences of the SefA, SefB and 
10 SefC proteins were compared to proteins listed in the GenBank (release #66.0), SWISS- 
PROT (release #16.0), and GENPEPT (release #64.3) data bases and the MACAW 
program (NCBI, Bethesda, MD) to align regions of local similarity among proteins 

exhibiting similarity. 

The adjacent downstream gene, sefB y encodes a fimbrial periplasmic 

15 chaperone protein. 

sefC, the gene immediately downstream of sefB, encodes a fimbrial outer 
membrane protein that contains nine putative membrane-spanning domains. 

The nucleotide sequence of DNA immediately downstream of sefABC 
revealed a fourth open reading frame (ORF) designated sefD, This gene had the same 
20 translational polarity as sefABC (Figure 1). In feet, the AUG start codon for sefD 
overlapped the UGA stop codon of sefC The gene organization of the gene cluster, has 
been confirmed on the chromosome by Southern blot analysis of Kpnl digested 
S. enteriditis 3b chromosomal DNA hybridized with sefA and seJD specific probes. 
Preceding the sefD ORF by 8 bp was a consensus Shine-Dalgamo sequence for 
25 translation initiation (GGAG). The sefD ORF was 447 bp and the predicted molecular 
weight of the encoded protein, designated SefD, was 16,722 Daltons. 

The predicted amino acid sequence of SefD had a putative signal 
peptidase cleavage site between Ser-24 and Ser-25 as determined by the method of von 
Heijne (1984). The presence of a putative leader sequence suggested that the protein 
30 was exported from the cytoplasm to either the periplasmic space or the outer membrane. 

To confirm that sefABC encoded proteins of the predicted sizes, proteins 
were translated in vitro from pKXl. The plasmid-encoded proteins were labeled with 
35S-methionine using a cell-free coupled transcription-translation system (Prokaiyotic 
DNA-Directed Transcription-Translation System Kit, Amersham, Oakville, ON) 
35 according to the manufacturers 1 instructions. 

Plasrnids carrying either the 5.3 kb fragment of the sef peron or 
deletions thereof (delB 15, delB23, delDIO) were used as DNA templat s (Figure 8B). 
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Plasmids purified by alkaline lysis (Sambrook et al., supra) were incubated with the 
other reaction components in a final reaction volume of 30 |ji and incubated at 37°C for 
30 min. Unlabeled methionine (5 \il) was added, the mixture was incubated a further 5 
min and then the reaction was terminated by placing the reactions at 0°C. Ten \)1 of the 
5 reaction mixture was added to 2x Laemmli sample buffer (Laemmli, "Cleavage of 
structural proteins during the assembly of the head of bacteriophage T4," Nature 
227:680-685, 1970) and then subjected to microdialysis (Marusyk and Sergent, "A 
simple method of dialysis of small volume samples, 11 Anal, Biochem. 705:403-404, 
1989) on Millipore filters (VFWP25, Millipore, Millipore Corp., Bedford, MA) for 10 to 

10 15 min prior to SDS-PAGE analysis on 12% gels poured with a 5% stacking gel 
(Laemmli, supra). The acrylamide gel was fixed for 30 min in 7% acetic acid for 30 
min, dried onto 3MM paper for 1 hour at 80°C and then exposed to X-Omat-AR5 film 
(Kodak, Rochester, NY) to visualize the labeled proteins. 

Several translation products were identified (Figure 8A, lane 2). The 

15 14KA/r protein was identified on Western blots as SefA (Figure 8 A, lane 7). The 
90KM r protein was identified as SefC. The 27KA/r protein was identified as SefB. 
The 70K, 44K and 40KA/ r bands were likely minor degradation products of SefC 
because these bands were absent when pSCl, which contained a deletion in sefO, was 
used as the template (Figure 8A, lanes 2 and 3). The 16KM r band seemed to be a 

20 minor degradation product of SefB as this band remained when pSCl was used as a 
template (Figure 8 A, lanes 2, 3). When the three DNasel deletion subclones, delBIS, 
delB23 and delDIO, were each used as templates, the bands for SefB, SefC and their 
minor degradation products were absent (Figure 8A, lanes 4-6) indicating sefA and/or its 
upstream region is necessary for the expression of sefB and sefC, as was predicted from 

25 the DNA sequence analysis (Figures 1, 2). 

To confirm that translation of SefB and SefC was dependent on the 
presence of sefA and/or the region upstreiam of sefA, the transcription start sites for 
sefA, sefB and sefC were determined. Primer extension studies consistently revealed 
transcription start sites immediately upstream of sefA. These included two major 

30 extension products as well as several minor ones. When the primer extension reaction 
was performed at 50°C, a temperature expected to destabilize secondary structures, 
reverse transcriptase still stopped at all the sites with the same frequency suggesting that 
stem-loop structures were not blocking the migration of reverse transcriptase. No 
transcription start sites could be found immediately upstream of sefB or seJC (data not 

35 shown). These results indicated that the 5* end of the mRNA transcript of sefABC was 
initiated upstream of sefA. 



WO 94/25598 



37 



PCT/IB94/00207 



EXAMPLE 7 
In vivo expression of assembled fimbriae in E. coli 

Immunogold labeling of 5. enteritidis cells with polyclonal immune serum 
5 raised to purified SEF14 fimbriae revealed thin, filamentous organelles located on the 
cell surface (Figure 10A). E coli cells hosting pKXl or cos 48 were shown by Western 
blot analysis to produce SefA. 

To determine whether E coli cells hosting pKXl or cos48 were 
assembling intact fimbriae, E. coli cells were grown in 2.5 ml of colonization factor 
10 antigen static broth culture (Evans et al., "Hemagglutination of human group A 
erythrocytes by enterotoxigenic Escherichia coli isolated from adults with diarrhea: 
correlation with colonization factor," Infect Immun. 75:330-337, 1977) for 60 h at 
37°C Cells were then mounted on formvar-coated grids, incubated with immune serum 
to native SEF14 fimbriae, then labeled with protein A-gold (Auroprobe, Pharmacia, 
15 Uppsala, Sweden). Labeled cells were then negatively stained with ammonium 
molybdate. 

Immunoelectron microscopic examination of £ coli HB101 containing 
cos48 revealed the presence of immunogold-labeled SEF14 fimbriae on the cells surface 
(Figure 10B). Similar experiments demonstrated SEF14 fimbriae were produced in E 

20 coli containing a cloned 10 kb Kpnl fragment encoding se/U 2 U 2 ABCD. Examination of 
E coli JM109 carrying pKXl revealed the formation of surface blebs that were 
specifically gold labeled (Figure 10C). E. coli carrying 44 kb of S. enteritidis DNA 
(cos48) encompassing the sefABC operon displayed intact fimbriae, conversely, E coli 
carrying the 5.3 kb Hindm fragment subcloned from cos48 showed that distinguishable 

25 SEF14 filamentous fimbriae were not assembled, thus indicating that additional fimbrial 
genes are required for in vivo expression of assembled fimbriae. 

EXAMPLE 8 
Co-Expression Of The SefA. SefB and SefC Genes 

30 

seJB encodes a chaperone protein capable of preventing premature 
polymerization of fimbrial subunits in the periplasm. seJB can be used to chaperone the 
SefA protein, including the SefA protein fused to a foreign gene product (preferably 
comprising a foreign antigen) through the inner membrane to the periplasm, where the 
35 protein is ushered out of the cell by seJC. Such a foreign gene product can also be 
incorporated in a SEF14-like fimbrial structure. 
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A 10 ld> Kpnl fragment from cos48 is cloned into pUC19. The resulting 
clone is transformed into a competent K coli or Salmonella host cell. To demonstrate 
expression, the host cells are disrupted and subjected to Western blot analysis using 
antisera to SefA, SefB and SefC proteins. The supernatant (prior to cellular disruption) 
5 is also subjected to Western blot analysis using antisera to SefA protein. 

To effect transport of protein to the supernatant without requiring the 
expression of sefC, a 1.5 fcb Bsml fragment isolated from pTZ19 pim, is blunted using 
T4 DNA polymerase and cloned into the Smal ate of pUC19. The EcdRI/HindUl 
fragment of this clone is ligated into the EcdBUHinSBl sites of pINmi 1 3 -B1 to create a 
10 construct where transcription of sefAseJB is initiated from the strong tpp promoter, 
which promoter is undo* control of the lac operator. 

To demonstrate expression of the genes in £ coli, the resulting plasmid 
is transformed into a "leaky" K coli strain known in the art. Expression is induced from 
the lac operator by the addition of IPTG. Western blot analysis of disrupted cells using 
15 antisera to SefA and antisera to SefB shows that both proteins are produced in the E. 
coli. 

EXAMPLE 9 

Production of Protective Antibodies to Salmonella enteritidis 

20 

In a preliminary experiment, outbred female mice were immunized with 
5 x 10 5 5. enteritidis-2b by interperitoneal injection (IP) on day 0. Blood samples were 
collected at 10 and 20 days. Western blot analysis indicated that serum antibodies were 
generated to SefD (SEF18) but no detectable antibodies were found to the other 
25 fimbrins. 

Whole cells of wild type £ enteritidis 3b, an LPS o-polysaccharide 
deficient strain (3b TipAa4-9) and a SEF17-deficient strain (3b TnphoA 2-7f) were 
grown under various conditions (T medium, 37°C, 24h; CFA broth, static, 37°C, 48h; 
LB broth, aerated, 37°C, 24h) to variably express the four fimbrial types SEF14, SEF17, 

30 SEF18, and SEF21. Cells were harvested by centrifiigation, washed in PBS, 
resuspended to an OD650nm of 2.0, treated with a final concentration of 3.5% 
formaldehyde and, incubated overnight at 4°C, the cells were then washed 5 times in 
PBS, then resuspended to an OD650nm of 1.0. These whole cell bacterins were injected 
into three week old turkeys using Alhydrogel as an adjuvant. The turkeys were boosted 

35 7 and 14 days later and then bled. The serum was tested for antibodies against all four 
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fimbrial types on Western blots and it was found that the turkeys made antibodies to 
SEF14, SEF17 and SEF18. 

Purified SEF14, SEF17 and SEF21 were used to raise monoclonal 
antibodies using standard monoclonal antibody generation techniques (Harlow and Lane, 
5 supra). Several monoclonal antibodies were generated to SEF14 (total of 8), SEF17 
(total of 2) and SEF 21 (total of 5). One of the monoclonal antibodies generated to 
SEF14 and two generated to SEF21 were found to be of isotype IgA This indicates 
that some fimbriae may specifically illicit a secretory antibody response, which can be 
advantageous for a vaccine directed toward Salmonella, an enteric pathogen. 

10 

EXAMPLE 10 
Production of polyclonal antibodies to Fimbriae 

15 a standard immunization protocol has been used to raise rabbit 

polyclonal antibodies to each of the four fimbrial types, namely; SEF14, SEF17, SEF18 
and SEF21. Purified, insoluble fimbriae were obtained as previously described 
(Collinson et al., supra), resuspended in phosphate buffered saline and emulsified with 
Freund's complete adjuvant prior to subcutaneous and intramuscular injection of a 1- 

20 month-old female New Zealand White rabbit. The rabbit was subsequently boosted at 1 
or 2 week intervals with 50 to 500 ug protein emulsified in Freund's incomplete 
adjuvant. When the antibody titers were sufficiently high, the serum was collected. 
These serum antibodies were shown numerous times, via Western blotting techniques, to 
bind to their respective purified fimbrin subunits, and to bind the native fimbriae on 

25 whole cells as ascertained by immunoelecton microscopic observation of protein A gold- 
labelled fimbriae. 

EXAMPLE 11 
Creation of a sefA f SET 14) Mutant 

30 

pDRA, which contains a Dral fragment encompassing the sefA gene of 
S. ententidis inserted in the Smal site of pT7T318, is digested with £coO109I. Two 
oligonucleotides, LKC3A and LKC3B (Table 1, infra) are synthesized, annealed 
together, phosphorylated and ligated to £coO109I-digested pDRA to inactivate sefA. 
35 The pDRA:LKC3 construct is digested with BglR to remove multiple linkers and 
Sephaglas™ purified following agarose gel electrophoresis. Self-Iigation of this 
construct results in the generation of pKCS2 which contains sefA inactivated with a 
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single linker. pKCS2 is digested with EcoBI and Hindi to remove the sefA::LKC3 
fragment which is then purified prior to electroporation. 

To insertionally inactivate sefA with a kanamycin (Km) resistance gene, 
linearized pDra is blunt-ended using the Klenow fragment and ligated to the 1.55 kb 
5 Fspl fragment of pKEM containing the kanamycin resistance gene. The resulting 
plasmid, pKCSl, was digested with£ct>RI and HincU to cut out the sefAvXxa fragment 
which is purified prior to electroporation. 

S enteritidis 3b is transformed with the insertionally inactivated sefA gene 
fragment by electroporation to permit chromosomal sefA gene replacement by 
10 homologous recombination. Transformants are selected on media containing kanamycin 
for those transformed with fimA::Km or screened for the inability to produce SefA or 
SEF14 for those transformed with the fimA : :U<LC3. The constructs are confirmed by 
PCR and DNA sequencing or Western blot analysis. 

This method can also be applied using cloned and sequenced 
15 chromosomal sefA encoding DNA fragments from S. enteritidis (or SL berta, S. dublin, 
S. gattinarum, or 'SL pullorum). 

EXAMPLE 12 
Creation of an azfA (SEF1 7) Mutant 

20 

The plasmid pW3, which carries a 394 bp agfA fragment, containing a 
unique Xcml site at bp 156, is digested with Xcm\ and the linearized plasmid is purified 
by agarose gel electrophoresis and Sephaglas™ treatment. Two oligonucleotides, 
LKC1A and LKCIB (see Table 1), synthesized on a Model 391 PCR-Mate™ DNA 

25 synthesizer (Applied Biosystems, Mssissauga, ON), are mixed and annealed to create a 
double-stranded DNA fragment, LKC1, with 'sticky-ends' complementary to those 
generated by Xcml digestion. LKCl, which contains a Bgttl cleavage site, is 
phosphorylated using T4 polynucleotide kinase (Pharmacia P-L Biochemicals, Inc., 
Piscataway, NJ) (Sambrook et al. 1989) and ligated to Jfcwl-digested pW3 to 

30 insertionally inactivate agfA. Bgttl is used to digest pW3 carrying multiply ligated 
linkers. Following purification, pW3 carrying the Bgttl digested linkers is self-ligated to 
generate pKCAl. Alternatively, pW3 carrying the Bgttl digested linkers was mixed 
with the 3.4 kb HmdJH fragment of pHP45f2Cm (carrying a chloramphenicol resistance 
gene) to generate pKCA2. 

35 The resulting insertionally inactivated agfA gene fragment is cut from 

pKCAl or pKCA2 using EcdRl and purified. S enteritidis 3b is transformed with the 
insertionally inactivated agfA fragment by electroporation to allow chromosomal gene 
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replacement by homologous recombination. The transformants are selected on media 
containing chloramphenicol for those transformed with agfAv.QCm or on T-medium 
containing Congo red for the transformants containing agfA::LKCl. The genotype and 
phenotype of the agfA' transformants are confirmed by PCR and DNA sequencing or 
5 Western blot analysis, respectively. 

This method can also be applied using cloned and sequenced 
chromosomal DNA fragments from S. enteritidis which encode larger agfA flanking 
sequences to promote higher frequency of gene replacement by homologous 
recombination. 

10 

EXAMPLE 13 
Creation of a sefD (SEF18) mutant 

15 pSCC carries an S. enteritidis sefD gene fragment flanked EcoRI and 

Pstl sites. There are no unique internal restriction sites in sefD, so PCR is used to 
generate an internal BamHI she by site-specific mutagenesis. The primers KC6 and 
KC48 are used to amplify by PCR the 263 bp fragment. The product is digested with 
EcoRI and BamHI, purified and cloned into EcdSJIBamHL digested pUC18 to generate 

20 pKCDl. A second set of primers, KC5 and KC47, are used to amplify the 333 bp 
fragment. The product is digested with BamHL and Ps/I^purified, and cloned into 
BamWJPstl digested pKCDl to generate pKCD2. The 2.0 kb BamHI flSm/Spc 
fragment from pHP45Q (which contains streptomycin and spectinomycin resistance 
genes) is cloned into BamHI digested pKCD2 to generate pKCD3. The EcdKUBamHl 

25 fragment of pKCD3 is purified prior to electroporation. The sefD gene containing the 
engineered BamHI site is also insertionally inactivated using LKC2 linkers converted to 
LKC4 by digestion with Bgia. 

S enteritidis 3b is transformed with the resulting insertionally inactivated 
sefD fragment, sefD::QSmSpc 9 or sefD:LKC4 by electroporation to permit 

30 chromosomal gene replacement. The transformants are selected on media containing 
spectinomycin and streptomycin for those transformed with sefD::€lSpcSm or screened 
for the inability to produce SefD for those transformed with the sefD:±KC4. The 
genotype and phenotype of the sefD~ transformants are confirmed by PCR and DNA 
sequencing or Western blot analysis, respectively. 

35 This method can also be applied using cloned and sequenced 

chromosomal DNA fragments from S. enteritidis which encode larger flanking 



WO 94/25598 



42 



PCT/IB94/00207 



sequences around seJD and therefore increase the frequency of homologous gene 
replacement. 

5 EXAMPLE 14 

Creation of 2. FimA (SEF21) mutant 

A 171 bp HiriPI fragment of fimA previously subcloned into the unique 
Accl rite of M13mpl8 is cut from the replicative form by double digestion with BamHL 

10 and HiruSUL. This fimA fragment is subcloned into BamHL and HinSHL digested pUC18 
to generate pKCFl. Two complementary oligonucleotides, LKC2A and LKC2B (Table 
1), are synthesized, annealed together and then phosphoiylated with T4 polynucleotide 
kinase to generate the double-stranded linker LKC2 which contains an internal BgILL site 
and possessed sticky ends compatible with those generated by BstBl digestion. pKCFl 

15 is digested with BstBl, purified by agarose gel electrophoresis and Sephaglas™ 
treatment, ligated to LKC2, digested with BgILL and re-purified. pKCFl carrying the 
LKC2 linker is self-ligated to generate pKCF2. The fimk:±XC2 fragment is cut from 
pKCF2 with BamHL and JTwdlll and purified. To generate a fimX fragment 
insertionally inactivated with QTc, BstBl, linearized fragments of pKCFl and 1.9 kb 

20 Hindm fragments of pHP45i!Tc (carrying the tetracycline resistance gene (OTc)) are 
blunt-ended by treatment with the Klenow fragment of DNA poll and ligated to create 
pKCF3. Prior to introduction into S. enteritidis, the fimAv.QTc fragment is cut from 
pKCF3 with EcdRl and HindTLL and purified. 

S enteritidis 3b is transformed with the resulting insertionally inactivated 

25 fimA fragment by electroporation to allow chromosomal fimA gene replacement by 
homologous recombination. The fimA" transformants are selected on media containing 
tetracycline to select cells transformed with fimA::QTc or screened for the inability to 
produce FimA or SEF21 for those transformed with the fimA: IXC2. The genotype and 
phenotype of the fimA' recombinants are confirmed by PCR and DNA sequencing or 

30 Western blot analysis, respectively. 

This method can also be applied using cloned and sequenced 
chromosomal DNA fragments from 5. enteritidis which encode larger flanking 
sequences around fimA and therefore increase the frequency of homologous gene 
replacement. 
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Table 1 

Oligonucleotide sequences for cloning and mutagenesis procedures 
of S. enteritidis fimbria! genes. 



5 


Linker and 
primer names 


Sequence 










LKCl a 












LKC1A 


5 GCTAACAGAGTAAGATCTTGCTAACGAGCGG 


(SEQ. 


ID 


NO. ) 




LKC1B 


CCGATTGTCTCATTCTAGAACGATTGCTCGC 5 * 


(SEQ. 


ID 


NO. ) 


10 














LKC2* 












LKC2A 


5, CGCTAAGCAGATCTAAACCCTAATCCC 


(SEQ. 


ID 


NO. ) 




LKC2B 


GATTCGTCTAGATTTGGGATTAGGGGC 5 * 


(SEQ. 


ID 


NO. ) 


15 


LKC3 fl 












LKC3A 


^'ggcctaatgacctaagatcttgctaacgaat 


(SEQ. 


ID 


NO. ) 




LKC3B 


GATTACTGGATTCTAGAACGATTGCTTACCG 5 ' 


(SEQ. 


ID 


NO. ) 




KC5 


5, gagagggaaaaaggatcctcattagttcaag 


(SEQ. 


ID 


NO. ) 


20 


KC6 


CTCTCCCTTTTTCCTAGGAGTAATCAAGTTC 5 


(SEQ. 


ID 


NO. ) 




KC48 


5, agcggataacaatttcacacaggaaac 


(SEQ. 


ID 


NO. ) 




KC47 


5, cgccagggttttcccagtcacgac 


(SEQ. 


ID 


NO. ) 




LKC4* 










25 




5 'gatctaaaccctaatccccgctaagca 


(SEQ. 


ID 


NO. ) 



ATTTGGGATTAGGGGCGATTCGTCTAG 5 ' (SEQ. ID NO. ) 

a These linkers were designed to have TAA or TAG stop oodons in all three frames so that insertion of 
this piece of DNA into a gene will cause premature termination of the protein as well as introducing a 
3 0 useful cloning site. 

EXAMPLE 15 
In vivo expression of assembled fimbriae in E. coli 

Immunogold labeling of S. enteritidis cells with polyclonal immune serum 
35 raised to purified SEF14 fimbriae revealed thin, filamentous organelles located on the 
cell surface (Figure 6A). £. coli cells hosting pKXl or cos48 were shown by Western 
blot analysis to produce SefA. 
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To determine whether £ coli cells hosting pKXl or cos48 were 
assembling intact fimbriae, £ coli cells were grown in 2.5 ml of colonization factor 
antigen static broth culture (Evans et al., "Hemagglutination of human group A 
erythrocytes by enterotoxigenic Escherichia coli isolated from adults with diarrhea: 
5 correlation with colonization fector," Infect Immun. 75:330-337, 1977) for 60 h at 
37°C. Cells were then mounted on formvar-coated grids, incubated with immune serum 
to native SEF 14 fimbriae, then labeled with protein A-gold (Auroprobe, Pharmacia, 
Uppsala, Sweden). Labeled cells were then negatively stained with ammonium 
molybdate. 

10 Immunoelectron microscopic examination of £ coli HB101 containing 

cos48 revealed the presence of immunogold-Iabeled SEF 14 fimbriae on the cells surface 
(Figure 6B). However, examination of £ coli JM109 carrying pKXl revealed the 
formation of surface blebs that were specifically gold labeled (Figure 6C). £ coli 
carrying a 44 kb fragment of 5. enteritidis DNA (cos48) encompassing the 

15 se/U2UjABCD operon, as well as E coli carrying a 10 kb Kpnl fragment subcloned 
from cos48, displayed intact fimbriae. Conversely, £ coli carrying a 5.3 ki> Hm<SR 
fragment subcloned from cos48 showed that distinguishable SEF14 filamentous fimbriae 
were not assembled. 

20 From the foregoing, it will be appreciated that, although specific 

embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 
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Claims 

1. A composition able to elicit an immune response in an animal 
comprising an isolated protein selected from the group consisting of a SefC protein, a SefD 
protein, a TctC protein and an AgfX protein, in combination with a physiologically 
acceptable carrier or diluent. 

2. A vector construct comprising a mutant gene selected from the 
group consisting of a mutant tctA gene, a mutant tctB gene and a mutant tctC gene, said 
mutant gene capable of inactivating the corresponding tricarboxylic add transport pathway 
in Salmonella upon incorporation of sad vector construct into said Salmonella, to yield an 
attenuated Salmonella. 

3 . A biologically pure attenuated Salmonella comprising a mutation in a 
gene selected from the group consisting a tctA gene, a tctB gene, and a tctC gene, to yield 
said attenuated Salmonella, wherein said mutation eliminates the function of the 
corresponding tricarboxylic acid transport system of said attenuated Salmonella. 

4. A composition able to elicit an immune response in an animal 
comprising an attenuated Salmonella having an inactivating mutation in a gene or operon 
selected from the group consisting of a tctA gene, a tctB gene, a tctC gene, a tctl operon, a 
tctll operon and a tctlH operon, in combination with a physiologically acceptable carrier or 
diluent. 

5. A composition able to elicit an immune response in an animal 
comprising an attenuated Salmonella having a mutation in two or more fimbriae encoding 
genes selected from the group consisting of se/A, sefD, agfA, and JimA, wherein said 
mutations effectively prevent production of fimbriae from said genes, in combination with a 
physiologically acceptable carrier or diluent 

6. The composition of any one of claim 4 or 5 able to elicit an immune 
response in an animal wherein said attenuated Salmonella is capable of expressing a foreign 
antigen in one r more of its fimbriae or in an aggregate. 
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7. The Salmonella of claim 6 wherein said attenuated Salmonella is 
capable of expressing a foreign antigen fused to a SefA, SefC, SefD, TctC, FimA, or AgfA 
protein. 

8. An expression vector construct comprising a sefA gene, a sefO gene, 
sefD gene, a FimA gene, a tctC gene or an agfA gene operably linked in open reading frame 
to a foreign gene to yield a dicistronic gene product, said dicistronic gene product capable 
of being expressed in a fimbriae, or an aggregate comprising said gene product, of a 
Salmonella. 

9. An expression vector construct comprising a sefA gene, a seJC gene, 
sefD gene, zfimA gene, a tctC gene or an agfA gene operably linked in open reading frame 
to a foreign gene to yield a dicistronic gene product, said dicistronic gene product capable 
of being expressed in a fimbriae, or an aggregate comprising said gene product, of an & 
coll 

10. An expression vector construct comprising an agfA gene capable of 
producing in£ coli a stable fimbriae comprising AgfA protein. 

11. A stable fimbriae comprising an AgfA. protein fused to one or more 
foreign antigens. 

12. A method of eliciting an immune response in an animal, comprising: 

(a) separating fimbriae comprising a SefA. protein, a SefD protein, a 
FimA protein, or an AgfA protein fused to a foreign antigen grown on a Salmonella host 
cell from said Salmonella host cell; and 

(b) introducing said fimbriae into said animal in conjunction with a 
physiologically acceptable carrier or diluent 

13. A method of eliciting an immune response in an animal to 
Salmonella, comprising: 

(a) separating an amino acid polymer comprising a SefA protein, a SefC 
protein, a SefD protein, a FimA protein, a TctC protein, or an AgfA protein fused to a 
foreign antigen grown on a Salmonella host cell from said Salmonella host cell; and 

(b) introducing said amino acid polymer into said animal in conjunction 
with a physiologically acceptable carrier or diluent 
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14. A method of eliciting an immune response in an animal to 

Salmonella, comprising: 

(a) separating fimbriae comprising a SefX protein, a SefD protein or an 
AgfA protein grown on an £ coli host cell from said £ colt host cell; and 

(b) introducing said fimbriae into said animal in conjunction with a 
physiologically acceptable carrier or effluent 

15. A method of eliciting an immune response in an animal to 

Salmonella, comprising: 

(a) separating an amiilo acid polymer comprising a SefA protein, a SefC 
protein, a SefD protein, a TctC protein, or an AgfA protein grown on an £ coli host cell 
from said £ coli host cell; and 

(b) introducing said amino acid polymer into said animal in conjunction 
with a physiologically acceptable canier or diluent. 

16. The method of claim 14 or 15 wherein said protein is fused to a 
foreign antigen. 

17. A method of eliciting an immune response in an animal to Salmonella 
comprising introducing into said animal one of the group consisting of an isolated SefD 
protein, an isolated SefC protein and an isolated AgfA protein, in combination with a 
physiologically acceptable carrier or effluent 

18. A method of eliciting an immune response in an animal to 
Salmonella, comprising introducing an attenuated Salmonella into said animal, said 
Salmonella comprising an ineffective gene or operon selected from the group consisting of 
a tctA gene, a tctB gene, a tctC gene, a ted operon, a tetIZ operon, and a tctUI operon. 

19. A method of eliciting an immune response in an animal comprising 
introducing into an animal an attenuated Salmonella having a mutation in two or more 
fimbriae encoding genes selected from the group consisting of a sefA, seJD, agfA, wdfimA y 
wherein said mutations effectively prevent production of fimbriae from said genes, in 
combination with a physiologically acceptable carrier or diluent 

20. The method of claim 18 or 19 wherein said at tenu a t ed Salmonella 
expresses one or more foreign antigens, said foreign antigens eliciting an immune response 
from said animal. 
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21. The method of claim 20 wherein said one or more foreign antigens 
are located on a fimbriae of said attenuated Salmonella. 

22. A method of eliciting an immune response in an animal to Salmonella 
comprising introducing an £1 coli into said animal, said EL coli expressing Salmonella 
fimbriae. 

23. The method of claim 22 wherein said Salmonella fimbriae comprise 
agfA gene product 

24. The method of claim 22 wherein said Salmonella fimbriae further 
comprises a foreign antigen, said foreign antigen capable of eliciting an immune response 
from said animal. 

25. A method of eliciting an immune response in an animal to Salmonella 
comprising introducing a vector into said animal wherein said vector comprises a gene 
selected from the group consisting of a se/C gene, sefD gene, tctC, andanag^l gene. 
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FIGURE 2A 

20 40 60 80 

GGGGATGnGTGTAAAGATAAAAAMTAGTGATCCnGTTTIT^ 

100 120 sefA 140 -10 * 

RrArAKATAAATTCTfyfiA ATfirTMTAGTTGAT rnrrGGAGATTnGTAAT ATG CGT AAA TCA GCA TCT GCA GTA 
* * ** IHF SD SefA Met arg lys ser ala ser ala val 

160 180 200 220 

GCA GTT CTT GCT TTA ATT GCA TGT GGC AGT GCC CAC GCA GCT GGC TTT GTT GGT AAC AAA GCA 
ala val leu ala leu ile ala cys gly ser ala his ala ala gly phe val gly asn lys ala 

240 260 280 

GTG GTT CAG GCA GCG GTT ACT ATT GCA GCT CAG AAT ACA ACA TCA GCC AAC TGG AGT CAG GAT 
val val gin ala ala val thr ile ala ala gin asn thr thr ser ala asn trp ser gin asp 

300 320 340 

CCT GGC TTT ACA GGG CCT GCT GTT GCT GCT GGT CAG AAA GTT GGT ACT CTC AGC An ACT GCT 
pro gly phe thr gly pro ala val ala ala gly gin lys val gly thr leu ser ile thr ala 

360 380 400 

ACT GGT CCA CAT AAC TCA GTA TCT An GCA GGT AAA GGG GCT TCG GTA TCT GGT GGT GTA GCC 
thr gly pro his asn ser val ser ile ala gly lys gly ala ser val ser gly gly val ala 

420 440 460 

ACT GTC CCG TTC GTT GAT GGA CAA GGA CAG CCT GTT TTC CGT GGG CGT ATT CAG GGA GCC AAT 
thr val pro phe val asp gly gin gly gin pro val phe arg gly arg ile gin gly ala asn 

480 500 520 

ATT AAT GAC CAA GCA AAT ACT GGA ATT GAC GGG CTT GCA GGT TGG CGA GTT GCC AGC TCT CAA 
ile asn asp gin ala asn thr gly ile asp gly leu ala gly trp arg val ala ser ser gin 

540 560 580 600 

GAA ACG CTA AAT GTC CCT GTC ACA ACC TTT GGT AAA TCG ACC CTG CCA GCA GGT ACT TTC ACT 
glu thr leu asn val pro val thr thr phe gly lys ser thr leu pro ala gly thr phe thr 

620 

GCG ACC TTC TAC GTT CAG CAG TAT CAA AAC TAA 
ala thr phe tyr val gin gin tyr gin asn XXX 

640 660 680 700 

TTTMTTTAAACTTTATAMTGCCCTCM^ 

720 740 sefB 760 780 

GGTTTTATGCTTCCATGCAAAMCTTAAAGisQGGATT ATG TAT ATT TTG AAT AAA TTT ATA CGT AGA ACT 

SD SefB Met tyr ile leu asn lys phe ile arg arg thr 
800 820 840 

GTT ATC TTT TTC TTT TTT TGC TAC CTT CCA ATT GCT TCT TCG GAA AGT AAA AAA ATT GAG CAA 
val ile phe phe phe phe cys tyr leu pro ile ala ser ser glu ser lys lys ile glu gin 

§60 880 900 

CCA TTA TTA ACA CAA AAA TAT TAT GGC CTA AGA TTG GGC ACT ACA CGT GTT ATT TAT AAA GAA 
pro leu leu thr gin lys tyr tyr gly leu arg leu gly thr thr arg val ile tyr lys glu 

920 940 960 

GAT GCT CCA TCA ACA AGT TTT TGG An ATG AAT GAA AAA GAA TAT CCA ATC CTT GTT CAA ACT 
asp ala pro ser thr ser phe trp ile met asn glu lys glu tyr pro ile leu val gin thr 

980 1000 1020 

CAA GTA TAT AAT GAT GAT AAA TCA TCA AAA GCT CCA TTT ATT GTA ACA CCA CCT ATT TTG AAA 
gin val tyr asn asp asp lys ser ser lys ala pro phe ile val thr pro pro ile leu lys 
1040 1060 1080 1100 

GTT GAA AGT AAT GCG CGA ACA AGA TTG AAG GTA ATA CCA ACA AGT AAT CTA TTC AAT AAA AAT 
val glu ser asn ala arg thr arg leu lys val ile pro thr ser asn leu phe asn lys asn 

1120 1140 1160 

GAG GAG TCT TTG TAT TGG TTG TGT GTA AAA GGA GTC CCA CCA CTA AAT GAT AAT GAA AGC AAT 
qlu qlu ser leu tyr trp leu cys val lys gly val pro pro leu asn asp asn glu ser asn 

1180 1200 1220 

AAT AAA AAC AAC ATA ACT ACG AAT CTT AAT GTG AAT GTG GTT ACG AAT AGT TGT ATT AAA TTA 
asn lys asn asn ile thr thr asn leu asn val asn val val thr asn ser cys ile lys leu 

1240 1260 1280 

ATT TAT AGG CCT AAA ACT ATA GAC TTA ACG ACA ATG GAG ATT GCA GAT AAA TTA AAG TTA GAG 
ile tvr arg pro lys thr ile asp leu thr thr met glu ile ala asp lys leu lys leu glu 

1300 1320 1340 

AGA AAA GGA AAT AGT ATA GTT ATA AAG AAT CCA ACA TCA TCA TAT GTG AAT ATT GCA AAT ATT 
arg lys gly asn ser ile val ile lys asn pro thr ser ser tyr val asn ile ala asn ile 

1360 1380 1400 

AAA TCT GGT AAT TTA AGT TTT AAT ATT CCA AAT GGA TAT ATT GAG CCA TTT GGA TAT GCT CAA 
lys ser gly asn leu ser phe asn ile pro asn gly tyr ile glu pro phe gly tyr ala gin 
J 1420 1440 1460 1480 

TTA CCT GGT GGA GTA CAT AGT AAA ATA ACT TTG ACT ATT TTG GAT GAT AAC GGC GCT GAA ATT 
leu pro gly gly val his ser lys ile thr leu thr ile leu asp asp asn gly ala glu ile 

ATA AGA GAT TAT TAG 
ile arg asp tyr XXX 
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FIGURE 2B 

icnn co^r 1520 1540 1560 

TTTAAGGTGTAAACAMTG AAG AAA ACC ACA ATT ACT CTA TTT GTT TTA ACC AGT GTA TTT CAC TCT 
mM i^Serc Met 1>s lys thr thr lie thr leu pteval leu thr ser val phe hiyer 

GGA AAT GTT TTC TCC AgPcAA TAT AAT TTC GAC TAT GGA AGT TTG AGT CTT CCT CCC GGT GAG 
^ asn Sal phe ser art gin tyr asn phe asp tjrjly ser leu ser leu pro progly glu 

AAT GCA TCT TTT CT^AGT GTT GAA ACG CTT CCT GGT AAT TAT GTT GTT GAT GTA TAT TTG AAT 
Ssn ala ser phe lei) ser val glu thr leu progly asn tyr val val asp vaMyr leu asn 

ml rAR tta AAA 7 GAA ACT ACT GAG TTG TAT TTC AAA TCA ATG ACT CAG ACT CTA GAA CCA TGC 
?sl> glS ™ iyt gli thr ?hr glu leu ty^ghe lys ser met thr gin thr leu glu pro cys 

TTA ACA AAA 7 GAA AAA CTT ATA AAG TAT GGG ATC GCC ATC CAG GAG CTT CAT GGG TTG CAG TTT 
™ th? lys glu lyt leu ile lys tyr gly lie ala lie gin glu Jeuhis gly leu gin phe 

GAT AATGAA CAA TGC GTT CTC TTA GAG CAT TCT CCT CTT AAA TAT ACT TAT AAC GCG GCT AAC 
aso Zsn gl5 glS §1 val leu leu glu his ser pro leu lys tyr^thr tyr asn ala ala asn^ 

rAAAGT TTG Cn TTA AAT GCA CCA TCT AAA ATT CTA TCT CCA ATA GAC AGT GAA ATT GCT GAT 
Slfi sfr 12 leu ™ un al2 pro ser lys fie leu ser nrcMle asp ser glu ile ala^p 

GAA AAT ATC TGG GAT GAT GGC^ATT AAC GCT TTT Cn TTA AAT TAC AGA GCT AAT TAT TTG CAT 
SlS «n ile 2? iTp Tsp % ile asn ala phe leu leu^sn tyr arg ala asn tyMeu h 1S 

TCT AAG GTT GGA GGA GaTgAT TCA TAC TTT GGT CAA ATT CAA CCT GGT TTT AAT TTT GGT CCC 
IS ljs Si fS gly glu asp ser tyr phe gly jjljHle gin pro gly phe asn^he gly pro 

rnr car tta ARR AAT^fTA TCA TCT TGG CAA AAC TTG TCA AGC GAA AAA AAA HT GAA TCA GCA 
Sp Sg S 5m afn led lef ler tfp gS asn leu ser ser glu lys lyyhe glu ser ala 

TAT ATT TAT GCT GAG CGA GGT TTA AAA AAA ATA AAG AGC AAA CTA ACA GTT GGG GAC AAA TAT 
tyr ile tyr Si glu arg gly leu lys Jg ile lys ser lys leu thr^al gly asp lys tyr 

ACC AGT GCA°GAT TTA TTC GAT AGC GTA CCA TTT AGA GGC TTT TCT TTA AAT AAA GAT GAA AGT 
thr ser ala asp leu phe asp ser val oro phe arg gl.y phe seMeu asn lys asp glu ser 

ATG ATA b CCT TTC TCA CAG AGA ACA TAT TAT CCA ACA ATA CGT GGT ATT GCG AAA ACC AAT GCG 
Set ile pro phe ser gin arg thr tyr tyr pro thr He arg^ly ile ala lys thr asn ala^ 

ACT 2 GTA GAA GTA AGA CAA AAT GGA TAC TTG ATA TAT TCT ACT TCA GTC CCC CCC GGG CAA TTC 
£ Sal gl£ Sal Tg gin asn gly tyr leu ile tyr serthr ser val pro pro gly gln^he 

GAR ATA GGT AGA GAA CAA ATT GCT GAT CTT GGT GTT GGG GTT GGG GTT CTT GAT GTT AGC ATT 
SlS llJ grj £J gl! glS ile ala asp leu gly vainly val gly val leu asp val*r He 

TAT GAA AAA AAT GGG CAGGTC CAA AAC TAT ACA GTG CCA TAT TCA ACT CCT GTA TTA TCT TTG 
Sr 91" lys Tsn Sly gin val gin asn tyr thrval pro tyr ser thr pro vaMeu ser leu 

fTT RAT GGA TAT TCT°AAA TAT AGT GTA ACT ATT GGT AGA TAC AGG GAG GTT AAC AAT GAT TAT 
pS % gly" tyr lev ly? tyV ser val thMJe gly arg tyr arg glu vaUjsn asn asp tyr 

ATr tat rrT rTT°TTT TTT GAA GGG ACT TAT ATA TAT GGT CTG CCT TAT GGG TTT ACT TTA TTT 
ile £J jK SS 52 Jhl glS gly Sr tyr ile tyr gly leu pro tgnjly phe thr leu phe 

GGT GGA GTGCAA TGG GTA AAT An TAT AAT TCA TAT GCC ATA GGC GCA AGT AAA GAT ATT GGT 
gly Sly val gin Sp val asn ile tyr asn ser tyr ala ile gl^jjla ser lys asp He gly 

GAG TATGGT GCT CTG TCT TTT GAc'tGG AAA ACA TCT GTT TCG AAG ACT GAT ACA TCC AAT GAA 
glS lyr 9ly Si 'eu ser phe asp trp lys thr ser val seMys thr asp thr ser asn |1u q 

AA1°GGT CAT G r A TAT GGG AT? 7 AGA TAC AAT AAA AAT ATC GCT CAG ACA AAC ACC GAA GTA TCT 
«n gfy hll al5 Sr afy ile a?g tyr asn lys asn ileala gin thr asn thr glu valuer 

US all S SJ Ifr €h s « as Sf 2| J? B E » » fC l S 55 
SS ffi K ^ % « « <§B « S S S T T e5 E S S8 K 
US SU 5 flUfe E H K£ 12 ^ S S T P tt gi ko . SI ^£ g1 T y 
«{ « JS 1 ail Sf gf W «l ^ HI 2£ » H S - B E » " 
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FIGURE 2C 



3080 3100 3120 

ACG AAA AGT ACA TCA AAG ATT ACT GM GAA AAT GAA GAT TTA TTC AGT TTT CTA CTC AGT GTA 
thr lys ser thr ser lys ile ser glu glu asn glu asp leu phe ser phe leu leu ser val 

314D 3160 3180 3200 

CCT TTG CAA AAA CH ACA AAT CAT GAA ATG TAT GCT ACA TAT CAA AAC TCA TCC TCT TCA AAG 
pro leu gin lys leu thr asn his glu met tyr ala thr tyr gin asn ser ser ser ser lys 

3220 3240 3260 

CAT GAT ATG AAT CAT GAT TTA GGT ATT ACT GGT GTG GCA TTT AAT AGC CAA TTG ACA TGG CAA 
his asp met asn his asp leu gly ile thr gly val ala phe asn ser gin leu thr trp gin 

3280 3300 3320 

GCA AGA GGG CAA ATA GAA GAT AAA TCG AAA AAT CAA AAG GCT ACA TTT TTA AAT GCT TCT TGG 
ala arg gly gin ile glu asp lys ser lys asn gin lys ala thr phe leu asn ala ser trp 

3340 3360 3380 

CGA GGT ACT TAT GGG GAG ATC GGA GCA AAC TAT AGT CAT AAT GAA ATA AAT CGT GAT ATT GGG 
arg gly thr tyr gly glu ile gly ala asn tyr ser his asn glu ile asn arg asp ile gly 

3400 3420 3440 

ATG AAT GTT TCT GGT GGG GTG ATT GCT CAT TCA TCA GGA ATT ACG TTT GGT CAG AGT ATA TCG 
met asn val ser gly gly val ile ala his ser ser gly ile thr phe gly gin ser ile ser 

3460 3480 3500 

GAT ACT GCT GCA CTG GTA GAG GCT AAA GGT GTA AGT GGG GCA AAA GTT CTG GGC CTA CCA GGT 
asp thr ala ala leu val glu ala lys gly val ser gly ala lys val leu gly leu pro gly 

3520 3540 3560 

GTT AGA ACC GAT TTT AGA GGC TAT ACA ATA TCC AGT TAT CTT ACT CCA TAT ATG AAT AAC TTC 
val arq thr asp phe arg gly tyr thr ile ser ser tyr leu thr pro tyr met asn asn phe 
3580 3600 3620 3640 

ATA TCT ATA GAT CCA ACA ACG TTA CCA ATA AAT ACG GAT ATT AGG CAA ACT GAT ATT CAA GTA 
ile ser ile asp pro thr thr leu pro ile asn thr asp ile arg gin thr asp ile gin val 

3660 3680 3700 

GTT CCT ACC GAA GGT GCT ATT GTA AAA GCT GTA TAT AAA ACA AGC GTG GGT ACT AAT GCA TTA 
val pro thr glu gly ala ile val lys ala val tyr lys thr ser val gly thr asn ala leu 

3720 3740 3760 

ATT AGA ATT ACA AGA ACA AAT GGA AAG CCA CTA GCT CTT GGC ACA GTT CTT TCA CTT AAG AAT 
ile arg ile thr arg thr asn gly lys pro leu ala leu gly thr val leu ser leu lys asn 

3780 3800 3820 

AAT GAT GGA GTA ATC CAA TCA ACA TCT ATT GTT GGC GAA GAT GGT CAG GCA TAT GTA TCT GGA 
asn asp gly val ile gin ser thr ser ile val gly glu asp gly gin ala tyr val ser gly 

3840 3860 3880 

TTG TCA GGA GTG CAA AAA TTA ATC GCT TCG TGG GGG AAT AAG CCC TCC GAT ACT TGT ACA GTT 
leu ser gly val gin lys leu ile ala ser trp gly asn lys pro ser asp thr cys thr val 

3900 3920 3940 

TTT TAC TCT CTT CCC GAT AAA AAC AAA GGT CAG ATT AGC TTT TTA AAT GGA GTG TGC AAA TGA 
phe tyr ser leu pro asp lys asn lys gly gin ile ser phe leu asn gly val cys lys XXX 
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FIGURE 2D 



sefD 

ATG AAT 
met asn 

sefD 

4000 
TTT CTG 
phe' leu 



GAA CTT 
glu leu 



CGT ATT 
arg ile 



AGA GAG 
arg glu 

4180 
AAA AAT 
lys asn 



GCC CAT 
ala his 



GAT AAT 
asp asn 



3960 3980 
CAG TAT AAT TCG TCA ATA CCT AAG TTC ATT GTC TCT GTT 
gin tyr asn ser ser ile pro lys phe ile val ser val 



ATT GTT 
ile val 



AAA TTA 
lys leu 



ACA AGT 
thr ser 

4140 
GGA AAA 

giy lys 



ACT GGT TTT 
thr gly phe 

4060 
ATG ATT AAA 
met ile lys 

4100 

AAT AAA ATA 
asn lys ile 



AAA ATC CTC 
lys ile leu 



4020 
TTC AGC 
phe ser 



ATA AAT 
ile asn 



ATA GGT 
ile gly 



ATT AGT 
ile ser 

4200 
ATT ATT 
ile ile 



TCA ACT ATT 
ser thr ile 



GAG GCT GTT 
glu ala val 

4120 
ACG GGG CAT 
thr gly his 

4160 

TCA AGT TTA 
ser ser leu 



4040 
AAA GCA CAA 
lys ala gin 

4080 

TTT TAT GAC 
phe tyr asp 



GAA 
glu phe 

4360 
ACA TAT 
thr tyr 



ACC CCA GGG GCA TAT 
thr pro gly ala tyr 

4240 

AAG CTT AGG ATA AGA ATA GGT 
lys leu arg ile arg ile gly 

4280 

TCA GGT ATT GGT ATG GTA TCT 
ser gly ile gly met yal ser 

4320 

AAT ATT TAT TTT TTT GGG AAT 
asn ile tyr phe phe gly asn 

4380 

TTA ATA AGC ATA TAT GCG ACA 
leu ile ser ile tyr ala thr 



AGA GGT CAG 
arg gly gin 

GGA GAA GAC 
gly glu asp 

4300 
CAT TCT GAT 
his ser asp 

4340 

GGA GAC ATT 
gly asp ile 



GAA ATT GAA 
glu ile glu 



CTA TTT AAC 
leu phe asn 

GAA AAA ATT 
glu lys ile 

4220 
AAT AAC TCA 
asn asn ser 

4200 

TGG CAA CCA 
trp gin pro 

TTT ACT AAT 
phe thr asn 

CCT GTT GAC 
pro val asp 

4400 
TTA TAA TAA 
leu XXX XXX 



SUBSTITUTE SHEET (RULE 26) 



WO 94/25598 ~ 6/17 PCT/IB94/00207 

FIGURE 3A 

478/11 

TC ACC CAC CCA TTT CTG ATT CGG GCC ACT GGC GTA AAA GCC CTG CTT CAG CAG ATT CTC 
lAGTG G GTG GGT AAA GAC TAA GCC CGG TGA CCG CAT TTT CGG GAC GAA GTC GTC TAA GAG 

OPA gly gly met glu ser glu pro trp gin arg leu leu gly ala glu ala ser glu arg . 

508/21 538/31 

val lys leu ser pro 
sefUl— * 

TGG ACT GGC AGA CCA TGT TCG CGG TAA CTG ACT GGA CTG ATC TTC dGTlfrA GCT TTC GCC 
ACC TGA CCG TCT GGT ACA AGC GCC ATT GAC TGA CCT GAC TAG AAG GCA CTT CGA AAG CGG 

ser gin cys val met asn ala thr val ser, gin val ser arg gly his leu lys arg gly 

568/41 598/51 

ala ala leu pro ala gin ala gly arg tyr gly phe tyr val ile his pro ser leu ser 

CGC AGC ACT GCC GGC GCA GGC TGG CCG CTA CGG TTT HA TGT TAT ACA CCC GTC CCT GAG 
GCG TCG TGA CGG CCG CGT CCG ACC GGC GAT GCC AAA AAT ACA ATA TGT GGG CAG GGA CTC 

cys cys gin arg arg leu ser ala ala val thr lys ile asn tyr val arg gly gin ala 

628/61 658/71 

thr lys leu ile arg gin ala trp arg thr val ala leu phe cys val thr glu cys leu 

CAC GAA GCT CAT CCG TCA GGC GTG GCG TAC CGT AGC GCT GTT TTG CGT CAC TGA ATG CCT 
GTG CTT CGA GTA GGC AGT CCG CAC CGC ATG GCA TCG CGA CAA AAC GCA GTG ACT TAC GGA 

arg leu glu asp thr leu arg pro thr gly tyr arg gin lys ala asp ser phe ala glu 

688/81 718/91 

pro tyr asp val ile thr asp lys ser glu leu leu thr pro asp val pro ala val thr 



CCC GTA CGA CGT TAT CAC AGA CAA GTC GGA ACT GCT GAC GCC GGA CGT ACC AGC TGT TAC 
GGG CAT GCT GCA ATA GTG TCT GTT CAG CCT TGA CGA CTG CGG CTT GCA TGG TCG ACA ATG 

arg val val asn asp cys val leu arg phe gin gin arg arg val tyr trp ser asn arg 

748/101 778/111 

gly asn leu lys tyr thr ala tyr gly phe asp thr glu leu ser leu met phe phe. asp 

GGG CAA CCT GAA GTA CAC GGC ATA TGG CTT TGA TAC TGA ACT CAG CCT GAT GTT TTT CGA 
CCC GTT GGA CTT CAT GTG CCG TAT ACC GAA ACT ATG ACT TGA GTC GGA CTA CAA AAA GCT 

ala val gin leu val arg cys ile ala lys ile ser phe glu ala gin his lys glu ile 

808/121 838/131 

glu asp ile leu his phe arg arg phe ala lys tyr val ala thr ile leu glu asn gly 

TGA AGA CAT ACT TCA TTT CAG GCG TTT CGC GAA GTA TGT CGC GAC CAT TCT GGA GAA TGG 
ACT TCT GTA TGA AGT AAA GTC CGC AAA GCG CTT CAT ACA GCG CTG GTA AGA CCT CTT ACC 

phe val tyr lys met glu pro thr glu arg leu ile asp arg gly asn gin leu ile thr 
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3B 



868/141 



898/151 



gin phe leu ile pro phe cys gin leu thr leu gin thr asp asp phe cys gly his leu 

TCA GTT CCT CAT CCC GTT CTG CCA GTT GAC GCT TCA GAC GGA CGA TTT CTG CGG ACA TCT 

AGT CAA GGA GTA GGG CAA GAC GGT CAA CTG CGA AGT CTG CCT GCT AAA GAC GCC TGT AGA 

leu glu glu asp arg glu ala leu gin arg lys leu arg val ile glu ala ser met glu 



leu phe ala phe arg arg arg glu leu ile leu leu phe ala ser pro val val glu leu 

CCT GTT CGC GTT CAG AAG AAG AGA GCT GAT TTT GCT GTT TGC TTC GCC AGT TGT AGA GCT 
GGA CAA GCG CAA GTC TTC TTC TCT CGA CTA AAA CGA CAA ACG AAG CGG TCA ACA TCT CGA 

gin glu arg glu ser ser ser leu gin asn gin gin lys ser arg trp asn tyr leu gin 



arg phe ile gin val lys leu ala gly ser arg ser his pro asn ala leu ala ser phe 

GCG ATT CAT ACA GGT TAA GCT CGC TGG TAG CCG CAG CCA TCC CAA TGC GTT AGC CAG TTT 
CGC TAA GTA TGT CCA ATT CGA GCG ACC ATC GGC GTC GGTUGG GTT ACG CAA TCG GTC AAA 

sefU2 

ser glu tyr leu asn leu glu ser thr ala ala ala met 
1048/201 1078/211 

lys ala ser ser arg asn ser gly val cys cys leu his gly phe leu val val asp ala 

CAA GGC TTC GTC GCG AAA TTC AGG CGT ATG TTG CTT GCA TGG CTT TTT GGT GGT TGA TGC 
GTT CCG AAG CAG CGC TTT AAG TCC GCA TAC AAC GAA CGT ACC GAA AAA CCA CCA ACT ACG 



1108/221 

ala phe val met OPA 

TGC TTT TGT CAT GEO 
ACG AAA ACA GTA CAC T 



928/161 



958/171 



988/181 



1018/191 
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3303 3312 3321 3330 3339 3348 

ACC GGG GTT TAT CGC TTT ACC TTT GAC AGC GTT CAT CTT TCC GAC GGC GTA CAG 

Arg Gly Leu Ser Leu Tyr Leu *** Gin Arg Ser Ser Phe Arg Arg Arg Thr Val 

3357 3366 3375 3384 3393 3402 

TTT ATC GTC GTG GTG ATC GGC CTG TTC TCG GTA TCA GAA ATA CTT TTA ATG CTG 

Tyr Arg Arg Gly Asp Arg Pro Val Leu Gly He Arg Asn Thr Phe Asn Ala Gly 

3411 3420 3429 3438 3447 3456 

GAA CAT ACC AGC AGC GGG CAA ACA ATG GTC CGC AAA ACG GGT CGA ATG TTG TTC 

Thr Tyr Gin Gin Arg Ala Asn Asn Gly Pro Gin Asn Gly Ser Asn Val Val Gin 

3465 3474 3483 3492 3501 3510 

AAC CTG AAA GAA GGC GCG CAG TGT ATC GGC ACC ACC CTG CGT TCT TCG GTA ATC 

Pro Glu Arg Arg Arg Ala Val Tyr Arg His His Pro Ala Phe Phe Gly Asn Arg 

3519 3528 3537 3546 3555 3564 

GGC TTT TTT GTC GGC GTA TTG CCC GGC GCC GGG CGA CCA TTG CCA GCG CCA TTA 

Leu Phe Cys Arg Arg He Ala Arg Arg Arg Ala Thr He Ala Ser Ala He Thr 

3573 3582 3591 3600 3609 3618 

CCT ATA TGA CCG AGA AAA AAC TCA GCG GCA ACA GCG ATA GCT TCG GCA AAG GGG 

Tyr Met Thr Glu Lys Lys Leu Ser Gly Asn Ser Asp Ser Phe Gly Lys Gly Asp 

3627 3636 3645 3654 3663 3672 

ATA TTC GCG GCG TCG CGG CGC CGG AGG CGG CAA ACA ACG CCT CTG CCT GCG GCT 

He Arg Gly Val Ala Ala Pro Glu Ala Ala Asn Asn Ala Ser Ala Cys Gly Ser 

3681 3690 3699 3708 3717 3726 

CCT TCA TCC CGA TGC TGA CGC TGG GCG TTC CCG GTT CCG GCA CTA CGG CAG TGA 

Phe lie Pro Met Leu Thr Leu Gly Val Pro Gly Ser Gly Thr Thr Ala Val Met 

3735 3744 3753 3762 3771 3780 

TGA TGG GGG CGC TGA CGC TGT ACA ACA TCA CGC CAG GCC CGG CGA TGT TCA CCG 

Met Gly Ala Leu Thr Leu Tyr Asn He Thr Pro Gly Pro Ala Met Phe Thr Glu 

3789 3798 3807 3816 3825 3834 

AAC AGC CGG ATA TCG TCT GGG GAC TCA TCG CTG CGC TGC TGA TTG CGA ACG TGA 

Gin Pro Asp He Val Trp Gly Leu lie Ala Ala Leu Leu lie Ala Asn Val Met 

3843 3852 3861 3870 3879 3888 

TGC TGC TGA TCA TGA ATA TCC CGT TGA TCG GTC TGT TCA CCC GTA TGC TCA CCA 

Leu Leu lie Met Asn He Pro Leu lie Gly Leu Phe Thr Arg Met Leu Thr He 
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3897 3906 3915 3924 3933 3942 

TTC CGC TGT GGT TCC TGG TAC CCG CCA TCG CTG CCG TAT CGG CGG TGG GGG TGT 



Pro Leu Trp Phe Leu Val Pro Ala lie Ala Ala Val Ser Ala Val Gly Val Tyr 

3951 3960 3969 3978 3987 3996 

ATG CGG TAC ACA GCA CCA CCT TCG ATC TGG TGC TGA TGG TCG CGC TCG GCG TGT 



Ala Val His Ser Thr Thr Phe Asp Leu Val Leu Met Val Ala Leu Gly Val Leu 

4005 4014 4023 4032 4041 4050 

TAG GGT ACA TTT TAC GTA AAA TGC ACT TCC CCA TGT CAC CGC TGA TCC TGG GGT 

Gly Tyr He Leu Arg Lys Met His Phe Pro Met Ser Pro Leu He Leu Gly Phe 

4059 4068 4077 4086 4095 4104 

TCG TAC TGG GGG AAA TGC TGG AGC AGA ACC TGC GTC GCG CAC TCT CCA TCA GTA 



Val Leu Gly Glu Met Leu Glu Gin Asn Leu Arg Arg Ala Leu Ser He Ser Asn 

4113 4122 4131 4140 4149 4158 

ACG GCA ATA TGG CGA TTT TGT GGC AAA GCG GCG TTG CCA AAG CCC TGC TGA TCA 

Gly Asn Met Ala He Leu Trp Gin Ser Gly Val Ala Lys Ala Leu Leu He Met 

4167 4176 4185 4194 4203 4212 

TGG CGA TCA TGG TCA TTG TCG TAC CGC CAG TGT TAC GTC TGC TCC GTA AAC ACA 

Ala He Met Val He Val Val Pro Pro Val Leu Arg Leu Leu Arg Lys His Ser 

4221 4230 4239 4248 4257 4266 

GCC GTA AAC CGC AGG TTG ACG CCG GTT AAT TGA CTG CTG AAA TAC GTT GTA CTT 

Arg Lys Pro Gin Val Asp Ala Gly *** Leu Thr Ala Glu He Arg Cys Thr Cys 

4275 4284 4293 4302 4311 4320 

GTC CGG CCT ACG CGC TCA TGT GTC AGG CCG GGC ACA TCC CCG CCA GCA TTC ACT 

Pro Ala Tyr Ala Leu Met Cys Gin Ala Gly His He Pro Ala Ser He His Phe 

4329 4338 4347 4356 4365 4374 

TTC CCC ATA ACG CCT CTC ATT TTA CAC CCC TTC TTG CCG TTG TCA GGC TCG TGG 



Pro His Asn Ala Ser His Phe Thr Pro Leu Leu Ala Val Val Arg Leu Val Ala 

4383 4392 4401 4410 4419 

CGC CGT TAA CCT CAC CCT TTG CAT TGT TAA ATA TTT GTT GTT TTT G 
Pro Leu Thr Ser Pro Phe Ala Leu Leu Asn He Cys Cys Phe *** 
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2735 2744 2753 2762 2771 2780 

5' ATG GAT ACC TGG ATA TAT CTT TCT CAG GGC TTT GCG GTG GCG ATG ACG CCG GAA 
Met Asp Thr Trp He Tyr Leu Ser Gin Gly Phe Ala Val Ala Met Thr Pro Glu 

2789 2798 2807 2816 2825 2834 

AAC CTG GTT ATC GCG TTG ATT GGC TGC TTC GTG GGC ACG ATC GTC GGT CTG CTG 
Asn Leu Val He Ala Leu He Gly Cys Phe Val Gly Thr He Val Gly Leu Leu 

2B43 2852 2861 2870 2879 2888 

CCG GGT CTG GGA CCG ATC AAC GGC GTG GCA ATC TTA CTG CCG CTG GCC TTT GCG 
Pro Gly Leu Gly Pro lie Asn Gly Val Ala lie Leu Leu Pro Leu Ala Phe Ala 

2897 2906 2915 2924 2933 2942 

TTG CAT CTG CCT GCG GAG TCG GCG CTA ATT CTG CTG GCG ACG GTG TAC ATT GGC 
Leu His Leu Pro Ala Glu Ser Ala Leu He Leu Leu Ala Thr Val Tyr He Gly 

2951 2960 2969 2978 2987 2996 

TGT GAG TAT GGC GGC AGG ATC TCC TCC ATA TTG CTC AAC GTC CCC GGC GAT GCG 
Cys Glu Tyr Gly Gly Arg He Ser Ser He Leu Leu Asn Val Pro Gly Asp Ala 

3005 3014 3023 3032 3041 3050 

GCG GCG ATC ATG ACG GCG CTG GAC GGT TAC CCG ATG GCG CAG CAA GGG AAA GGC 
Ala Ala He Met Thr Ala Leu Asp Gly Tyr Pro Met Ala Gin Gin Gly Lys Gly 

3059 3068 3077 3086 3095 3104 

GGC GTA GCG CTG TCG ATT TCC GCA GTC AGC TCA TTT TTC GGT TCA TTA ATC GCT 
Gly Val Ala Leu Ser He Ser Ala Val Ser Ser Phe Phe Gly Ser Leu He Ala 

3113 3122 3131 3140 3149 3158 

ATC GGC GGC ATC An CTG TTC GCC CCT TTA CTG GCG CAA TGG TCG CTG GCC TTT 
He Gly Gly He He Leu Phe Ala Pro Leu Leu Ala Gin Trp Ser Leu Ala Phe 

3167 3176 "3185 3194 3203 3212 

GGG CCG GCG GAA TAT TTC GCC TTA ATG GTT TTT GCC ATC GCC TGT CTT GGC AGC 
Gly Pro Ala Glu Tyr Phe Ala Leu Met Val Phe Ala He Ala Cys Leu Gly Ser 

3221 3230 
ATG ATG GCG CAA AAC CCG GCT TAA 3' 
Met Met Ala Gin Asn Pro Ala *** 
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1301 1310 1319 1328 1337 1346 

5" ATG AAA AAA CM TTA CTT CGT ACC CTT ACT GCA AGC ATT TTA TTA ATG AGT ACC 
Met Lys Lys Gin Leu Leu Arg Thr Leu Thr Ala Ser He Leu Leu Met Ser Thr 

1355 1364 1373 1382 1391 1400 

TCT GTT CTG GCG CAG GAG GCG CCG TCG CGA ACG GAA TGT ATC GCG CCA GCC AAA 
Ser Val Leu Ala Gin Glu Ala Pro Ser Arg Thr Glu Cys He Ala Pro Ala Lys 

1409 1418 1427 1436 1445 1454 

CCT GGC GGC GGT TTC GAC CTC ACC TGT AAG CTG ATT CAG GTG AGT TTG CTG GAG 
Pro Gly Gly Gly Phe Asp Leu Thr Cys Lys Leu He Gin Val Ser Leu Leu Glu 

1463 1472 1481 1490 1499 1508 

ACT GGC GCT ATC GAG AAA CCC ATG CGG GTA ACG TAT ATG CCC GGC GGC GTC GGC 
Thr Gly Ala lie Glu Lys Pro Met Arg Val Thr Tyr Met Pro Gly Gly Val Gly 

1517 1526 1535 1544 1553 1562 

GCT GTG GCC TAT AAC GCG ATA GTC GCC CAA CGC CCT GGC GAA CCC GGG ACT GTG 
Ala Val Ala Tyr Asn Ala He Val Ala Gin Arg Pro Gly Glu Pro Gly Thr Val 

1571 1580 1589 1598 1607 1616 

GTC GCC TTT TCC GGC GGT TCG CTG CTC AAC CTG TCG CAG GGG AAA TTT GGT CGC 
Val Ala Phe Ser Gly Gly Ser Leu Leu Asn Leu Ser Gin Gly Lys Phe Gly Arg 

1625 1634 1643 1652 1661 1670 

TAC GGC GTG GAT GAT GTG CGC TGG CTG GCA AGC GTG GGC ACT GAT TAC GGC ATG 
Tyr Gly Val Asp Asp Val Arg Trp Leu Ala Ser Val Gly Thr Asp Tyr Gly Met 

1679 1688 1697 1706 1715 1724 

ATC GCC GTG CGT GCG GAT TCT CCG TGG AAA ACG CTG AAA GAT CTG ATG ACG GCG 
He Ala Val Arg Ala Asp Ser Pro Trp Lys Thr Leu Lys Asp Leu Met Thr Ala 

1733 1742 1751 1760 1769 1778 

ATG GAA AAA GAT CCC AAC AGC GTG GTC ATT GGC GCT GGC GCC TCT ATT GGC AGC 
Met Glu Lys Asp Pro Asn Ser Val Val lie Gly Ala Gly Ala Ser He Gly Ser 

1787 1796 1805 1814 1823 1832 

CAG GAC TGG ATG AAG TCG GCA TTG CTG GCG CAA AAG GCG AAC GTC GAC CCG CAC 
Gin Asp Trp Met Lys Ser Ala Leu Leu Ala Gin Lys Ala Asn Val Asp Pro His 

1841 1850 1859 1868 1877 1886 

AAG ATG CGC TAC GTT GCC TTT GAG GGC GGC GGC GAG CCG GTG" ACG GCA TTA ATG 
Lys Met Arg Tyr Val Ala Phe Glu Gly Gly Gly Glu Pro Val Thr Ala Leu Met 

1895 1904 1913 1922 1931 1940 

GGC AAC CAT GTT CAG GTT GTC TCC GGC GAT CTC AGT GAA ATG GTG CCT TAT CTG 
Gly Asn His Val Gin Val Val Ser Gly Asp Leu Ser Glu Met Val Pro Tyr Leu 

1949 1958 1967 1976 1985 1994 

GGC GGC GAC AAA ATC CGC GTG CTT GCC GTC TTT TCA GAA AAT CGT CTG CCG GGC 
Gly Gly Asp Lys He Arg Val Leu Ala Val Phe Ser Glu Asn Arg Leu Pro Gly 
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2003 2012 2021 

CAG CTT 6CC AAT ATC CCT ACC GCT AAA 
Gin Leu Ala Asn He Pro Thr Ala Lys 

2057 2066 2075 

ATT ATT CGC GGC TTC TAC GTC GGG CCC 
He He Arg Gly Phe Tyr Val Gly Pro 

2111 2120 2129 

TGG GTG GAT ACC TTC AAG AAG CTC CAG 
Trp Val Asp Thr Phe Lys Lys Leu Gin 

2165 2174 2183 

GAT CTG CGC GGC CTG TTT GAG TTC GAC 
Asp Leu Arg Gly Leu Phe Glu Phe Asp 

2219 2228 2237 

GTG AAA AAA CAG GTT ACT GAT TAC CGT 
Val Lys Lys Gin Val Thr Asp Tyr Arg 





2030 








GAA 


CAG GGG 


TAC GAC CTG 


GTG 


TGG CCG 


Glu 


Gin Gly 


Tyr Asp Leu 


Val 


Trp Pro 




2084 


2093 






AAA 


GTC AGC 


GAT GCC GAT 


TAC 


CAG TGG 


Lys 


Val Ser 


Asp Ala Asp 


Tyr 


Gin Trp 




2138 


2147 




2156 


CAA 


ACC GAC 


GAG TTT AAA 


AAG 


CAG CGC 


Gin 


Thr Asp 


Glu Phe Lys 


Lys 


Gin Arg 




2192 


2201 




2210 


ATG 


ACC GGC 


CAG CAG CTC 


GAT 


GAC TAC 


Met 


Thr Gly 


Gin Gin Leu 


Asp 


Asp Tyr 




2246 


2255 




2264 


GAA 


CAG GCG 


AAA GCC TTC 


GGA 


CTC GCG 


Glu 


Gin Ala 


Lys Ala Phe 


Gly 


Leu Ala 



AAA TAA 3' 
Lys *** 



SUBSTITUTE SHEET (RULE 26) 



WO 54A25598 PCT/IB94/00207 

13/17 

FIGURE 7A 



•10 * *50_ 



cnr rt& ffA CAR Tfifi fiflC GGC GGC GGT AAT CAT AAC GGC GGC GGC AAT AGT TCC GGC CCG 
CAG CAT GGT GTC ACC CCS CCG CCG CCA TTA GTA TTG CCG CCG CCG TTA TCA AGG CCG GGC 
val val pro gin trp gly gly gly gly asn his asn gly gly gly asn ser ser gly pro 
2 5 10 15 20 



. • • -100 

GAC TCA ACfi TTG AGC ATT TAT CAG TAC GGT TCC GCT AAC GCT GCG CTT GCT CTG CAA AGC 
CTG AGT TGC AAC TCG TAA ATA GTC ATG CCA AGG CGA TTG CGA CGC GAA CGA GAC GTT TCG 
asp ser thr leu ser ile tyr gin tyr gly ser ala asn ala ala leu ala leu gin ser 
25 30 35 40 



•150 

GAT GCC CGT AAA TCT GAA ACG ACC ATT ACC CAG AGC GGT TAT GGT AAC GGC GCC GAT GTA 

CTA CGG GCA TTT AGA CTT TGC TGG TAA TGG GTC TCG CCA ATA CCA TTG CCG CGG CTA CAT 

asp ala arg lys ser glu thr thr ile thr gin ser gly tyr gly asn gly ala asp val 
45 50 55 60 



•200 .... 
GGC CAG GGT GCG GAT AAT AGT ACT ATT GAA CTG ACT CAG AAT GGT TTC AGA AAT AAT GCC 
CCG GTC CCA CGC CTA TTA TCA TGA TAA CTT GAC TGA GTC TTA CCA AAG TCT TTA TTA CGG 
qlv qln qly ala asp asn ser thr ile glu leu thr gin asn gly phe arg asn asn ala 
s J 65 70 75 80 

•250 .... -300 

ACC ATC GAC CAG TGG AAC GCT AAA AAC TCC GAT ATT ACT GTC GGC CAA TAC GGC GGT AAT 
TGG TAG CTG GTC ACC TTG CGA TTT TTG AGG CTA TAA TGA CAG CCG GTT ATG CCG CCA TTA 
thr ile asp gin trp asn ala lys asn ser asp ile thr val gly gin tyr gly gly asn 
85 90 95 100 



•350 

AAC GCC GCG CTG GTT AAT CAG ACC GCA TCT GAT TCT GAC TCT TAT ACA CAA GTA GCG TCC T 
TTG CGG CGC GAC CAA TTA GTC TGG CGT AGA CTA AGA CTG AGA ATA TfiT (TTT CAT CGC AGG A 
asn ala ala leu val asn gin thr ala ser asp ser asp ser tyr thr gin val ala ser 
105 110 
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1/1 31/11 

ATG AAA CTT TTA AAA GTG GCA GCA TTC GCA GCA ATC GTA GIT TCT GGC AGT GCT CTG GCT 
Met lys leu leu lys val ala ala phe ala ala ile val val ser gly ser ala leu ala 

61/21 91/31 

GGC GTC GTT CCA CAA TGG GGC GGC GGC GGT AAT CAT AAC GGC GGC GGC AAT AGT TCC GGC 
gly val val pro gin trp gly gly gly gly asn his asn gly gly gly asn ser ser gly 

121/41 151/51 

CCG GAC TCA ACG TTG AGC ATT TAT CAG TAC GGT TCC GCT AAC GCT GCG CTT GCT CTG CAA 
pro asp ser thr leu ser ile tyr gin tyr gly ser ala asn ala ala leu ala leu gin 

181/51 211/71 

AGC GAT GCC CGT AAA TCT GAA ACG ACC ATT ACC CAG AGC GGT TAT GGT AAC GGC GCC GAT 
ser asp ala arg lys ser glu thr thr ile thr gin ser gly tyr gly asn gly ala asp 

241/81 271/91 

GTA GGC CAG GGT GCG GAT AAT AGT ACT ATT GAA CTG ACT CAG AAT GGT TTC AGA AAT AAT 
val gly gin gly ala asp asn ser thr ile glu leu thr gin asn gly phe arg asn asn 

301/101 331/111 

GCC ACC ATC GAC CAG TGG AAC GCT AAA AAC TCC GAT ATT ACT GTC GGC CAA TAC GGC GGT 
ala thr ile asp gin trp asn ala lys asn ser asp ile thr val gly gin tyr gly gly 

351/121 391/131 

AAT AAC GCC GCG CTG GTT AAT CAG ACC GCA TCT GAT TCC AGC GTA ATG GTG CGT CAG GTT 
asn asn ala ala leu val asn gin thr ala ser asp ser ser val met val arg gin val 

421/141 451/151 
GGT TTT GGC AAC AAC GCC ACG GCT AAC CAG TAT TAA 
gly phe gly asn asn ala thr ala asn gin tyr OCH 
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